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CHEMICALLY MODIFIED PROTEINS WITH A CARBOHYDRATE MOIETY 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Patent Application 
Serial No. 60/091,687, filed July 2, 1998, and U.S. Provisional Patent Application Serial 
5 No. 60/13 1,446, filed April 28, 1999, and which are hereby incorporated by reference. 

FIELD OF THE INVENTION 

The present invention relates to chemically modified* mutant proteins 
having modified glycosylation patterns with respect to a precursor protein from which 
they are derived. In particular, the present invention relates to a chemically modified 
mutant protein including a cysteine residue substituted for a residue other than cysteine in 
a precursor protein, the substituted cysteine residue being subsequently modified by 
reacting the cysteine residue with a glycosylated thiosulfonate. The present invention 
also relates to a method of producing the chemically modified mutant proteins and a 
glycosylated methanethiosulfonate. Another aspect of the present invention is a method 
of modifying the functional characteristics of a protein by reacting the protein with a 
glycosylated methanethiosulfonate reagent. The present invention also relates to methods 
of determining the structure-function relationships of chemically modified mutant 
proteins. 

BACKGROUND OF THE INVENTION 

Modifying enzyme properties by site-directed mutagenesis has been 
limited to natural amino acid replacements, although molecular biological strategies for 
overcoming this restriction have recently been derived (Cornish et ah, Angew. Chem. , Int. 
25 Ed. Engl., 34:621-633 (1995)). However, the latter procedures are difficult to apply in 
most laboratories. In contrast, controlled chemical modification of enzymes offers broad 
potential for facile and flexible modification of enzyme structure, thereby opening up 
extensive possibilities for controlled tailoring of enzyme specificity. 
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Changing enzyme properties by chemical modification has been explored 
previously, with the first report being in 1966 by the groups of Bender (Polgar et al., J. 
Am. Chem. Soc , 88:3 153-3 154 (1966)) and Koshland (Neet et aL, Proc. Natl. Acad. Sci. 
USA , 56:1606-161 1 (1966)), who created a thiolsubtilisin by chemical transformation 
5 (CH 2 OH —> CH 2 SH) of the active site serine residue of subtilisin BPN' to cysteine. 
Interest in chemically produced artificial enzymes, including some with synthetic 
potential, was renewed by Wu (Wu et aL, J. Am. Chem. Soc . 1 1 1:4514-4515 (1989); Bell 
( , et al., Biochemistry, 32:3754-3762 (1993)) and Peterson (Peterson et aL, Biochemistry , 

34:6616-6620 (1995)), and, more recently, Suckling (Suckling et aL, Bioorg. Med. Chem. 
B 10 Lett,, 3:531-534 (1993)). 

Enzymes are now widely accepted as useful catalysts in organic synthesis. 
However, natural, wild-type, enzymes can never hope to accept all structures of synthetic 

\-:.-3 

s chemical interest, nor always be transformed stereospecificaily into the desired 

fy enantiomerically pure materials needed for synthesis. This potential limitation on the 

JT 1 5 synthetic applicabilities of enzymes has been recognized, and some progress has been 

O made in altering their specificities in a controlled manner using the site-directed and 

random mutagenesis techniques of protein engineering. However, modifying enzyme 
properties by protein engineering is limited to making natural amino acid replacements, 
and molecular biological methods devised to overcome this restriction are not readily 
20 amenable to routine application or large scale synthesis. The generation of new 

specificities or activities obtained by chemical modification of enzymes has intrigued 
chemists for many years and continues to do so. 

U.S. Patent No. 5,208,158 to Bech et al. ("Bech") describes chemically 
modified detergent enzymes where one or more methionines have been mutated into 
25 cysteines. The cysteines are subsequently modified in order to confer upon the enzyme 
improved stability towards oxidative agents. The claimed chemical modification is the 
replacement of the thiol hydrogen with C,. 6 alkyl. 

Although Bech has described altering the oxidative stability of an enzyme 
through mutagenesis and chemical modification, it would also be desirable to develop one 
30 or more enzymes with altered properties such as activity, nucleophile specificity, 

substrate specificity, stereoselectivity, thermal stability, pH activity profile, and surface 



binding properties for use in, for example, detergents or organic synthesis. In particular, 
enzymes, such as subtilisins, tailored for peptide synthesis would be desirable. Enzymes 
useful for peptide synthesis have high esterase and low amidase activities. Generally, 
subtilisins do not meet these requirements and the improvement of the esterase to amidase 
selectivities of subtilisins would be desirable. However, previous attempts to tailor 
enzymes for peptide synthesis by lowering amidase activity have generally resulted in 
dramatic decreases in both esterase and amidase activities. Previous strategies for 
lowering the amidase activity include the use of water-miscible organic solvents (Barbas 
et al., J. Am. Chem. Soc , 1 10:5162-5166 (1988); Wong et aL, J. Am. Chem. Soc ., 
1 12:945-953 (1990); and Sears et aL, Biotechnol. Prog. , 12:423-433 (1996)) and site- 
directed mutagenesis (Abrahamsen et al., Biochemistry , 30:4151-4159 (1991); Bonneau 
et aL, "Alteration of the Specificity of Subtilisin BPN' by Site-Directed Mutagenesis in its 
SI and SI' Binding-Sites" J. Am. Chem. Soc , 113:1026-1030 (1991); and Graycar et aL, 
Ann. N. Y. Acad. ScL , 67:71-79 (1992)). However, while the ratios of esterase-to- 
amidase activities were improved by these approaches, the absolute esterase activities 
were lowered concomitantly. Abrahamsen et aL, Biochemistry , 30:4151-4159 (1991). 
Chemical modification techniques (Neet et aL, Proc. Nat. Acad. ScL , 56:1606 (1966); 
Polgar et aL, J. Am. Chem. Soc , 88:3153-3154 (1966); Wu et aL, J. Am. Chem. Soc , 
111:451 4-45 1 5 (1989); and West et aL, J. Am. Chem. Soc , 112:531 3-5320 ( 1 990)), 
which permit the incorporation of unnatural amino acid moieties, have also been applied 
to improve esterase to amidase selectivity of subtilisins. For example, chemical 
conversion of the catalytic triad serine (Ser221) of subtilisin to cysteine (Neet et aL, Proc 
Nat. Acad. ScL , 56:1606 (1966); Polgar et aL, J. Am. Chem. Soc , 88:3153-3154 (1966); 
and Nakatsuka et aL, J. Am. Chem. Soc , 109:3808-3810 (1987)) or to selenocysteine 
(Wu et aL, J. Am. Chem. Soc , 111:4514-4515 (1989)), and methylation ofthe catalytic 
triad histidine (His57) of chymotrypsin (West et aL, J. Am. Chem. Soc , 1 12:5313-5320 
(1990)), effected substantial improvement in esterase-to-amidase selectivities. 
Unfortunately however, these modifications were again accompanied by 50- to 1000-fold 
decreases in absolute esterase activity. 

Surface glycoproteins act as markers in cell-cell communication events 
that determine microbial virulence (Sharon et aL, Essays Biochem. , 30:59-75 (1995)), 



inflammation (Lasky, Annu. Rev. Biochem. . 64:1 13-139 (1995); Weis et al., Annu. Rev. 
Biochem. , 65:441-473 (1996)), and host immune responses (Varki, Glycobiol. , 3:97-130 

(1993) ; Dwek, Chem. Rev. , 96:683-720 (1996)). In addition, the correct glycosylation of 
proteins is critical to their expression and folding (Helenius, Mol. Biol. Cell. 5:253-265 

(1994) ) and increases their thermal and proteolytic stability (Opdenakker et al., FASEB J., 
7:1330-1337 (1993)). Glycoproteins occur naturally in a number of forms (glycoforms) 
(Rademacher et al, Annu. Rev. Biochem. . 57:785-838 (1988)) that possess the same 
peptide backbone, but differ in both the nature and site of glycosylation. The differences 
exhibited (Rademacher et al., Annu. Rev. Biochem. , 57:785-838 (1988); Parekh et al., 
Biochem. . 28:7670-7679 (1989); Knight, Biotechnol. . 7:35-40 (1989)) by each 
component within these microheterogeneous mixtures present regulatory difficulties (Liu, 
Trends Biotechnol. . 10:1 14-120 (1992); Bill et al., Chem. Biol. . 3:145-149 (1996)) and 
problems in determining exact function. To explore these key properties, there is a 
pressing need for methods that will not only allow the preparation of pure glycosylated 

5 proteins, but will also allow the preparation of non-natural variants for the determination 
of structure-function relationships, such as structure-activity relationships (SARs). The 
few studies that have compared single glycoforms successfully have required abundant 
sources and extensive chromatographic separation (Rudd et al., Biochem., 33:17-22 
(1994)). Neoglycoproteins (Krantz et al., Biochem. , 15:3963-3968 (1976)), formed via 

20 unnatural linkages between sugars and proteins, provide an invaluable alternative source 
of carbohydrate-protein conjugates (For reviews see Stowell et al, Adv. Carbohydr. 
Chem. Biochem.. 37:225-281 n980V Neoglvcoconiugates: Preparation and Applicat ions, 
Lee et al., Eds., Academic Press, London (1994); Abelson et al., Methods Enzymol., 242: 
(1994); Lee et al., Methods Enzymol. , 247: (1994); Bovin et al., Chem. Soc. Rev., 

25 24:413-421 (1995)). In particular, chemical glycosylation allows control of the glycan 
structure and the nature of the sugar-protein bond. However, despite these advantages, 
existing methods for their preparation (Stowell et al., Adv. Carbohydr. Chem. Biochem., 
37:225-281 (1980)) typically generate mixtures. In addition, these techniques may alter 
the overall charge of the protein (Lemieux et al., J. Am. Chem. Soc, 97:4076-4083 - 

30 (1975); Kobayashi et al., Methods Enzvmol. . 247:409-418 (1994)) or destroy the cyclic 
nature of glycans introduced (Gray, Arch. Biochem. Biophvs. , 163:426-428 (1974)). For 



example, the reductive amination of lactose with bovine serum albumin (BSA) caused 
indiscriminate modification of lysine residues through the formation of acyclic amines 
introduced (Gray, Arch. Biochem. Biophvs. . 163:426-428 (1974)). Advances in the site- 
specific glycosylation of BSA have been made (Davis et al., Tetrahedron Lett., 32:6793- 
6796 (1991); Wong et al.. Biochem. J. , 300:843-850 (1994); Macindoe et al., I Chem. 
Soc. Chem. Commun.. 847-848 (1998)). However, these methods rely upon modification 
of an existing cysteine in BSA and, as such, allow no flexibility in the choice of 
glycosylation site. Glycoproteins occur naturally as complex mixtures of differently 
glycosylated forms which are difficult to separate. To explore their properties, there is a 
need for homogenous sources of carbohydrate-protein conjugates. Existing methods 
typically generate product protein mixtures of poorly characterized composition, with 
little or no control over the site or level of glycosylation. 

The present invention is directed to overcoming these deficiencies. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide for novel glycosylated 

proteins. 

It is a further object of the invention to provide for novel glycosylated 
proteins which have modified or improved functional characteristics. 

It is a further object of the invention to provide for a method of producing 
glycosylated proteins which have well defined properties, for example, by having 
predetermined glycosylation patterns. 

According to the present invention, a method is provided wherein the 
glycosylation pattern of a protein is modified in a predictable and repeatable manner. 
Generally, the modification of the protein occurs via reaction of a cysteine residue in the 
protein with a glycosylated thiosulfonate. 

Thus, in one composition aspect of the present invention, a chemically 
modified mutant protein is provided, wherein said mutant protein differs from a precursor 
protein by virtue of having a cysteine residue substituted for a residue other than cysteine 
in said precursor protein, the substituted cysteine residue being subsequently modified by 
reacting said cysteine residue with a glycosylated thiosulfonate. Preferably, the 
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glycosylated thiosulfonate is an alkylthiosulfonate, most preferably a 
methanethiosulfonate. 

In a method aspect of the present invention, a method of producing a 
chemically modified mutant protein is provided comprising the steps of: (a) providing a 
precursor protein; (b) substituting an amino acid residue other than cysteine in said 
precursor protein with a cysteine; (c) reacting said substituted cysteine with a 
glycosylated thiosulfonate, said glycosylated thiosulfonate comprising a carbohydrate 
moiety; and (d) obtaining a modified glycosylated protein wherein said substituted 
cysteine comprises a carbohydrate moiety attached thereto. Preferably, the glycosylated 
thiosulfonate is an alkylthiosulfonate, most preferably, a methanethiosulfonate. Also 
preferably, the substitution in said precursor protein is obtained by using recombinant 
DNA techniques by modifying a DNA encoding said precursor protein to comprise DNA 
encoding a cysteine at a desired location within the protein. 

The present invention also relates to novel glycosylated thiosulfonates. In 
a preferred embodiment, the glycosylated thiosulfonate is a methanethiosulfonate. In a 
most preferred embodiment, the glycosylated methanethiosulfonate comprises a chemical 
structure including: 

O 

II 

H 3 C— S— SR 
II 

O 

where R comprises -p-Glc, -Et-p-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-p-Glc(Ac) 2 , -p-Glc(Ac) 5 , -p-Glc(Ac) 4 , -Et-a-Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , 
-Et-P-Glc(Ac) 2 , -Et-p-Gle(Ac) 35 -Et-P-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , -Et-a-Man(Ac) 4 , 
-Et-p-Gal(Ac) 3 , -Et-p-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including reacting the protein with a glycosylated 
thiosulfonate reagent under conditions effective to produce a glycoprotein with altered 
functional characteristics as compared to the protein. Accordingly, the present invention 
provides for modified protein, wherein the protein comprises a wholly or partially 
predetermined glycosylation pattern which differs from the glycosylation pattern of the 
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protein in its precursor, natural, or wild type state and a method for producing such a 
modified protein. 

The present invention also relates to methods of determining the structure- 
function relationships of chemically modified mutant proteins. One method includes 
5 providing first and second chemically modified mutant proteins of the present invention, 
wherein the glycosylation pattern of the second chemically modified mutant protein 
differs from the glycosylation pattern of the first chemically modified mutant protein, 
H evaluating a functional characteristic of the first and second chemically modified mutant 

O proteins, and correlating the functional characteristic of the first and second chemically 

5 J 10 modified mutant proteins with the structures of the first and second chemically modified 

^ mutant proteins. Another method involves providing first and second chemically 

O modified mutant proteins of the present invention, wherein at least one different cysteine 

- 3 residue in the second chemically modified mutant protein is modified by reacting said 

: ,: cysteine residue with a glycosylated thiosulfonate, evaluating a functional characteristic 

15 of the first and second chemically modified mutant proteins, and correlating the 
y functional characteristic of the first and second chemically modified mutant proteins with 

the structures of the first and second chemically modified mutant proteins. 

The chemically modified mutant proteins of the present invention provide 
an alternative to site-directed mutagenesis and chemical modification for introducing 
20 unnatural amino acids into proteins. Moreover, the methods of the present invention 
allow the preparation of pure glycoproteins (i.e., not mixtures) with predetermined and 
unique structures. These glycoproteins can then be used to determine structure-function 
relationships (e.g., structure-activity relationships ("SARs")) of non-natural variants of 
the proteins. 

25 An advantage of the present invention is that it is possible to introduce 

predetermined glycosylation patterns into proteins in a simple and repeatable manner. 
This advantage provides an ability to modify critical protein characteristics such as 
partitioning, solubility, cell-cell signaling, catalytic activity, biological activity and 
pharmacological activity. Additionally, the methods of the present invention provide for 

30 a mechanism of "masking" certain chemically or biologically important protein sites, for 
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example, sites which are critical for immunological or allergenic response or sites which 
are critical to proteolytic degradation of the modified protein. 

Another advantage of the present invention is the ability to glycosylate a 
protein which is not generally glycosylated, or to modify the glycosylation pattern of a 
5 protein which is generally glycosylated. 

Another advantage of the present invention is to produce enzymes which 
have altered catalytic activity. In one specific example, the inventors herein have shown 
that it is possible to modify the substrate specificity of a protease to increase the esterase 
activity as compared to the amidase activity. Similarly, modifications of substrate 
10 specificity would be expected when utilizing the present invention with other enzymes. 

These and other advantages of the present invention are described in more 
detail in the following detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 Figure 1 shows dendrimer methanethio sulfonate ("MTS") reagents. 

Figure 2 shows the synthesis of a first generation glycodendrimer reagent 
which bears two D-mannose units on its termini and has one arm as a MTS which can be 
attached to a subtilisin Bacillus lentus cysteine mutant. 

Figure 3 shows the synthesis of highly-functionalized glycodendrimer- 

20 protein conjugates. 

Figure 4 shows two parallel synthetic approaches to modification of 
subtilisin Bacillus lentus with dendrimers. Both approaches allow the use of a large 
library of methanethiosulfonate reagents (R-SS0 2 Me) to cap the dendrimeric branches. 
The routes shown allow for the preparation of both dimeric and trimeric dendrimers 
25 Figure 5 shows peptide coupling catalyzed by an enzyme. 

Figure 6 shows the preparation of two types of glycosylating reagents from 
D-glucose (2a): the anomeric methanethiosulfonate la and the ethyl-tethered 
methanethiosulfonates lb, c, g, h. 

Figure 7 shows the preparation of the a-D-manno-MTS reagents Id and li, 
30 which are epimeric at C-2 relative to lb and lg, respectively, and the p-D-galacto-MTS 
reagents le and lj, epimeric at C-4 .relative to lc and lh, respectively. 
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Figure 8 shows the reaction of the thiol residue of cysteine introduced into 
subtilisin Bacillus lentus with glycomethanethiosulfonate reagents. 

Figure 9 shows the reaction of 2,3.4,6-tetra-O-acetyl-P-D-glucopyranosyl 
methanethiosulfonate (la) with subtilisin Bacillus lentus-H62C. -S156C, -S166C, - 
L217C. 

Figures 10A-D show deprotected glycan structure-proteolytic activity 
SARs of subtilisin Bacillus lentus cysteine mutants and glycos\ lated chemically modified 
mutant enzymes ("CMMs") relative to wild-type ("WT"). A break in the axis indicates 
that the value was not determined. At position 62, glycosylation partially restores the 
decrease in kJK AI caused by mutation to cysteine (R - H) (Figure 10A). At position 217, 
the 4-fold decrease in activity caused by mutation (R = H) is amplified to 6-fold lower 
than WT by glycosylation with untethered S-(5-Glc, but reduced to around 2.5-fold lower 
than WT by glycosylation with ethyl-tethered glycans (b-f) (Figure 10B). At position 
1 56 5 an arced variation in activity reaches a 3 -fold lower than WT minimum k c JK M at 
bulky lacto-CMM S156C-S-f (Figure IOC). At position 166, the 2.5-fold decrease in 
k c JK hi caused by mutation is amplified by glycosylation. kJK u decreases monotonically 
from S166C (R - H) to a value that is 3.8-fold lower than WT for S 1 66C-S-f (Figure 
10D). 

Figures 1 1 A-D show the acetylated glycan structure-proteolytic activity 
SARs of glycosylated chemically modified mutant enzymes relative to WT. For each 
glycan the number of acetate groups present is indicated by a label on the corresponding 
bar. A break in the axis indicates that the value was not determined. At positions 62 
(Figure 1 1 A), 217 (Figure 1 IB), and 166 (Figure 1 ID) an alternating trend in activity is 
observed as a result of the opposite effects of acetylation upon kJK M according to 
anomeric stereochemistry (see Figure 12). This results in a kJK M for N62C-S-g that is 
1 .1 -fold higher than WT. At position 156 (Figure 11 C), variations are slight and this is 
consistent with its surface exposed orientation. 

Figures 12 A-D show the variation in proteolytic activity of glycosylated 
chemically modified mutant enzymes of subtilisin Bacillus lentus upon acetylation of 
glycans. Comparison of the activity of acetylated with fully deprotected chemically 
modified mutant enzymes shows that at positions 62 (Figure 12 A) and 217 (Figure 12B) 
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acetylation enhances the activity of a-tethered chemically modified mutant enzymes but 
decreases that of (3-tethered. In contrast, at position 166 (Figure 12D), acetylation 
decreases the activity of a-tethered CMMs but increases that of p-tethered. Consistent 
with its surface exposed orientation, changes at position 156 (Figure 12C) are modest. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. 

Figures 13A-D show esterase kJK s p for deprotected glyco-CMMs 
relative to WT. At position 62 (Figure 13 A), in the S, pocket, glycosylation leads to a 
series of enzymes that have similar activities that are 1.3- to 1.9-fold greater than WT. 
At position 217 (Figure 13B), in the S,' pocket, glycosylation also increases kJK M , to a 
maximum 3.5-fold greater than WT for L217C-SEtGal (-e). At position 156 (Figure 
13C), in the S, pocket, glycosylation leads to a reduction in kJK M . At position 166 
(Figure 13D), in the S, pocket, the dramatic loss of activity upon mutation to cysteine (R 
= H) is restored by glycosylation. All five S166C deprotected glyco-CMMs have similar 
k c JK u s that are 1.1- to 1.4-fold lower than WT. 

Figures 14A-D show the effect of acetylation on kJK M of glyco-CMMs. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. At 
positions 62 (Figure 14A), in the S 2 pocket, and 217 (Figure 14B), in the S," pocket, the 
effect of acetylation is dependent on anomeric stereochemistry. At both sites, acetylation 
of a-linked sugars (-b,-d) leads to an increase in kJK u , whereas kJK M is decreased for 
p-linked sugars (-c,-e,-f). kJK M also increases as the number of acetates increases. 
Consistent with the surface-exposed nature of its side chain, acetylation at position 156 
(Figure 14C), in the S, pocket, has very little effect on kJK sl . At position 166 (Figure 
14D), in the S, pocket, the effects of acetylation are opposite to those observed at 
positions 62 and 217. Acetylation increases kJK. u of p-linked glyco-CMMs (-c,-e), 
while causing a decrease for the a-linked glyco-CMMs (-b,-d). 

Figures 15A-D show the E/A of deprotected glyco-CMMs relative to WT. 
A break in the axis indicates that the value was not determined. Glycosylation with 
deprotected reagents lb-f increases the E/A ratio in all cases. The greatest effects are 
observed at positions 62, in the S 2 pocket (Figure 15 A), and 217, in the S,' pocket (Figure 



15B), where the largest increases in E/A result in values up to 10.9-fold greater than WT. 
At both sites the E/A is higher for the (3-linked glyco-CMMs than the cc-linked ones. At 
positions 156 (Figure 15C) and 166 (Figure 15D), in the S s pocket, there is little variation 
in E/A. 

Figures 16A-D show the effect of introducing acetylated glycans on E/A. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. 
Values greater than zero indicate acetylation increases E/A, negative values denote a 
reduction in E/A upon acetylation. 

Figure 17 shows the modeling of the high esterase activity of L217C-S- 
Glc(Ac) 3 . A minimized structure of L217C-S-Glc(Ac), of the active site of SBL showing 
the catalytic residues Ser221 5 His64. The carbon atoms of the triacetyated D-glucose 
moiety, which is bound to Leu217 via a disulfide bond, are numbered. The phenyl ring of 
product AAPF occupies the S 1 binding site and forms a crucial hydrogen bond (1.72 A) 
to Wat 127. This water molecule is further stabilized by a second hydrogen bond (1 .89 A) 
to the carbonyl O of the C-2 acetate group of glucose. 

Figure 18 shows the proposed acyl-enzyme intermediate of L217C-S- 
Glc(Ac) 3 . The carboxy terminus of AAPF forms a bond to the O y atom of Ser221. 
Wat 127 acting as the crucial deacylating nucleophilic water molecule, is stabilized 
through its hydrogen bond to the carbonyl group of the C-2 acetate group. 

DETAILED DESCRIPTION OF THE INVENTION 

According to the present invention, a method is provided wherein the 
glycosylation pattern of a protein is modified in a predictable and repeatable manner. 
Generally, the modification of the protein occurs via reaction of a cysteine residue in the 
protein with a glycosylated thiosulfonate. 

Thus, in one composition aspect of the present invention, a chemically 
modified mutant protein is provided, wherein said mutant protein differs from a -precursor 
protein by virtue of having a cysteine residue substituted for a residue other than cysteine 
in said precursor protein, the substituted cysteine residue being subsequently modified by 



reacting said cysteine residue with a glycosylated thiosulfonate. Preferably, the 
glycosylated thiosulfonate is an alkylthiosulfonate, most preferably, a 
methanethiosulfonate. 

In a method aspect of the present invention, a method of producing a 
chemically modified mutant protein is provided comprising the steps of: (a) providing a 
precursor protein; (b) substituting an amino acid residue other than cysteine in said 
precursor protein with a cysteine; (c) reacting said substituted cysteine with a 
glycosylated thiosulfonate, said glycosylated thiosulfonate comprising a carbohydrate 
moiety; and (d) obtaining a modified glycosylated protein wherein said substituted 
cysteine comprises a carbohydrate moiety attached thereto. Preferably, the glycosylated 
thiosulfonate is an alkylthiosulfonate, most preferably, a methanethiosulfonate. Also 
preferably, the substitution in said precursor protein is obtained by using recombinant 
DNA techniques by modifying a DNA encoding said precursor protein to comprise DNA 
encoding a cysteine at a desired location within the protein. The amino acid residues to 
be substituted with cysteine residues according to the present invention may be replaced 
using site-directed mutagenesis methods or other methods well known in the art. See, for 
example, PCT Publication No. WO 95/10615, which is hereby incorporated by reference. 

The present invention also relates to a glycosylated thiosulfonate. 
Preferably, the glycosylated thiosulfonate comprises methanethiosulfonate. More 
preferably, the methanethiosulfonate comprises the chemical structure: 

O 

li 

H 3 C— S— SR 
il 

O 

where R comprises -p-Glc, -Et-p-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-P-Glc(Ac) 2 , -p-Glc(Ac) 3 , -P-Glc(Ac) 4 , -Et-a-Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , 
-Et-p-Glc(Ac) 2 , -Et-P-Glc(Ac) 3 , -Et-P-Glc(Ac) 4 , -Et-a-Man(Ac) 3? -Et-a-Man(Ac) 4 , 
-Et-p-Gal(Ac) 3 , -Et-P-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including providing a protein and reacting the 
protein with a glycosylated thiosulfonate reagent under conditions effective to produce a 
glycoprotein with altered functional characteristics as compared to the protein. 



Accordingly, the present invention provides for modified protein, wherein the protein 
comprises a wholly or partially predetermined glycosylation pattern which differs from 
the glycosylation pattern of the protein in its precursor, natural, or wild type state and a 
method for producing such a modified protein. As used herein, glycosylation pattern 
means the composition of a carbohydrate moiety. The present invention also relates to 
methods of determining the structure-function relationships of chemically modified 
mutant proteins. The first method includes providing first and second chemically 
modified mutant proteins of the present invention, wherein the glycosylation pattern of 
the second chemically modified mutant protein differs from the glycosylation pattern of 
the first chemically modified mutant protein, evaluating a functional characteristic of the 
first and second chemically modified mutant proteins, and correlating the functional 
characteristic of the first and second chemically modified mutant proteins with the 
structures of the first and second chemically modified mutant proteins. The second 
method involves providing first and second chemically modified mutant proteins of the 
present invention, wherein at least one different cysteine residue in the second chemically 
modified mutant protein is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate, evaluating a functional characteristic of the first and second chemically 
modified mutant proteins, and correlating the functional characteristic of the first and 
second chemically modified mutant proteins with the structures of the first and second 
chemically modified mutant proteins. 

The chemically modified mutant proteins of the present invention provide 
a valuable source of carbohydrate-protein conjugates. Moreover, the methods of the 
present invention allow the preparation of pure and glycoproteins (i.e., not mixtures) with 
predetermined and unique structures. These glycoproteins can then be used to determine 
structure-function relationships (e.g., structure-activity relationships ("SARs")) of non- 
natural variants of the proteins. 

The protein of the invention may be any protein for which a modification 
of the glycosylation pattern thereof may be desirable. For example, proteins which are 
naturally not glycosylated may be glycosylated via the invention. Similarly, proteins 
which exist in a naturally glycosylated form may be modified so that the glycosylation 
pattern confers improved or desirable properties to the protein. Specifically, proteins 
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useful in the present invention are those in which glycosylation plays a role in functional 
characteristics such as, for example, biological activity, chemical activity, 
pharmacological activity, or immunological activity. 

Glycosylated proteins as referred to herein means moieties having 
carbohydrate components which are present on proteins, peptides, or amino acids. In the 
present invention, the glycosylation is provided, for example, as a result of reaction of the 
glycosylated thiosulfonate with the thiol hydrogen of a cysteine residue thereby producing 
an amino acid residue which has bound thereto the carbohydrate component present on 
the glycosylated thiosulfonate. 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including providing a protein and reacting the 
protein with a glycosylated thiosulfonate reagent under conditions effective to produce a 
glycoprotein with altered functional characteristics as compared to the protein. 

The functional characteristics of a protein which may be altered by the 
present invention include, but are not limited to, enzymatic activity, the effect on a human 
or animal body, the ability to act as a vaccine, the tertiary structure (i.e., how the protein 
folds), whether it is allergenic, its solubility, its signaling effects, its biological activity, 
and its pharmacological activity (Paulson, "Glycoproteins: What are the Sugar Chains 
For? 55 , Trends in Biochem. Sciences , 14:272-276 (1989), which is hereby incorporated by 
reference). The use of glycosylated thiosulfonates as thiol-specific modifying reagents in 
the method of the present invention allows virtually unlimited alterations of protein 
residues. In addition, this method allows the production of pure glycoproteins with 
predetermined and unique structures and, therefore, unique functional characteristics, with 
control over both the site and level of glycosylation. In particular, the method of 
modifying the functional characteristics of a protein allows the preparation of single 
glycoforms through regio- and glycan-specific protein glycosylation at predetermined 
sites. Such advantages provide an array of options with respect to modification of protein 
properties which did not exist in the prior art. The ability to produce proteins having very 
specific and predictable glycosylation patterns will enable the production of proteins 
which have known and quantifiable effects in chemical, pharmaceutical, immunological, 
or catalytic performance. For example, with knowledge of a specific problematic epitope, 
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it would be possible to construct a modified protein according to the present invention in 
which the epitope is masked by a carbohydrate moiety, thus reducing its allergenic or 
immunogenic response in a subject. As another example, where the solubility of a protein 
is problematic in terms of recovery or formulation in a pharmaceutical or industrial 
application, it may be possible, utilizing the present invention, to produce a protein which 
has altered solubility profiles thus producing a more desirable protein product. As 
another example, if a protein has particular problem of being proteolytically unstable in 
the environment in which it is to be used, then it may be possible to mask the proteolytic 
cleavage sites in the protein using the present invention to cover up such a site with a 
carbohydrate moiety. These examples are merely a few of the many applications of the 
present invention to produce improved proteins. 

In a preferred embodiment, the protein is an enzyme. The term "enzyme" 
includes proteins that are capable of catalyzing chemical changes in other substances 
without being changed themselves. The enzymes can be wild-type enzymes or variant 
enzymes. Enzymes within the scope of the present invention include pullulanases, 
proteases, cellulases, amylases, isomerases, lipases, oxidases, and reductases. Preferably, 
the enzyme is a protease. The enzyme can be a wild-type or mutant protease. Wild-type 
proteases can be isolated from, for example, Bacillus lentus or Bacillus amyloliquefaciens 
(also referred to as BPN'). Mutant proteases can be made according to the teachings of, 
for example, PCT Publication Nos. WO 95/10615 and WO 91/06637, which are hereby 
incorporated by reference. Functional characteristics of enzymes which are suitable for 
modification according to the present invention include, for example, enzymatic activity, 
solubility, partitioning, cell-cell signaling, substrate specificity, substrate binding, 
stability to temperature and reagents, ability to mask an antigenic site, physiological 
functions, and pharmaceutical functions (Paulson, "Glycoproteins: What are the Sugar 
Chains For?", Trends in Biochem. Sciences , 14:272-276 (1989), which is hereby 
incorporated by reference). 

The protein is modified so that a non-cysteine residue is substituted with a 
cysteine residue, preferably by recombinant means. Preferably, the amino acids replaced 
in the protein by cysteines are selected from the group consisting of asparagine, leucine, 
or serine. 



- 16- 



The terms ""thiol side chain group," "thiol containing group," and "thiol 
side chain" are terms which are can be used interchangeably and include groups that are 
used to replace the thiol hydrogen of a cysteine used to replace one of the amino acids in a 
protein. Commonly, the thiol side chain group includes a sulfur through which the thiol 
5 side chain groups defined above are attached to the thiol sulfur of the cysteine. 

The glycosylated thiosulfonates of the invention are those which are 
capable of reacting with a thiol hydrogen of a cysteine to produce a glycosylated amino 
acid residue. By glycosylated is meant that the thiosulfonate has bound thereto a sugar or 
carbohydrate moiety which can be transferred to a protein pursuant to the present 
10 invention. Preferably, the glycosylated thiosulfonates are glycosylated 

alkylthiosulfonates, most preferably, glycosylated methanethiosulfonates. Such 
glycosylated methanethiosulfonate have the general formula: 

O 

i! 

H 3 C— S— SR 
II 

O 

In particularly preferred embodiment, the methanethiosulfonate comprises 
15 an R group which comprises: -p-Glc, -Et-p-Gal 5 -Et-p-Glc, -Et-a-Glc, -Et-a-Man, 
^Et-Lac, -P-Glc(Ac) 2 , -p-Glc(Ac) 3 , -P-Glc(Ac) 4 , -Et-ct-Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , 
-Et-a-Glc(Ac) 4 , -Et-P-Glc(Ac) 2 , -Et-p-Glc(Ac) 3 , -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , 
-Et-a-Man(Ac) 4 , -Et-p~Gal(Ac) 3 , -Et^-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et- 
Lac(Ac) 7 . 

20 In a preferred embodiment, the carbohydrate moiety of the present 

invention is a dendrimer moiety. Multiple functionalization of chemically modified 
mutant proteins can be achieved by dendrimer approaches, whereby multiple-branched 
linking structures can be employed to create poly-functionalized chemically modified 
mutant proteins. 

25 Highly branched molecules or dendrimers were first synthesized by Vogtle 

in 1978 (Buhleier et ah, Synthesis , 155-158 (1978), which is hereby incorporated by 
reference). The attachment of identical building blocks that contain branching sites to a 
central core may be achieved with a high degree of homogeneity and control. Each 
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branch contains a functional group which, after chemical alteration, may be connected to 
yet another branching building block. In this manner, layer after layer of branching 
rapidly generates highly-functionalized molecules. 

For instance, multiple glycosylation, including multiple mannose- 
5 containing chemically modified mutant proteins, and varied sugar moieties, can be 
created. The dendrimer reagent structures would include methanethiosulfonates with 
simple branching such as: 

CH 3 S0 2 S J # 



derived from pentaerythritol, to very complex branched dendrimer reagents (see Figure 
10 1). 

In particular, a first generation glycodendrimer reagent is synthesized as 
shown in Figure 2. This approach can be extended to cover larger dendrimers. More 
specifically, by leaving one "arm" of the glycodendrimer free for conversion to a 
methanethiosulfonate, the remaining arms can be further branched to synthesize highly- 
15 functionalized glycodendrimer reagents as shown in Figure 3. Through further branching 
and by using different carbohydrates, this concept can be extended to virtually unlimited 
levels. 

A flexible synthetic strategy for the synthesis of core dendrimeric 
methanethiosulfonate building blocks that may be used either in situ or before 
20 modification to construct dendrimers is shown in Figure 4. 

The present invention also relates to glycosylated thiosulfonate 
compositions- Preferably the glycosylated thiosulfonates are methanethiosulfonates and 
comprise a chemical structure: 

O 

II 

H 3 C— S— SR 
ii 

O 

25 wherein R comprises -p-Glc, -Et-(3-Gal, -Et-p-Glc, -Et-a-GIc, -Et-a-Man, -Et-Lac, -p- 
Glc(Ac) 2 , -P-Glc(Ac) 3 , -P-Glc(Ac) 4 , -Et-oc-Glc(Ac) 2 , -Et-a-Gic(Ac) 3 , -Et-a-Glc(Ac) 4 , -Et- 
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P-Glc(Ac),, -Et-p-Glc(Ac),, -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac),, -Et-a-Man(Ac) 4 , -Et-p- 
Gal(Ac) 3 , -Et-p-Gal(Ac),, -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

The present invention also relates to a method of determining the structure- 
function relationships of chemically modified mutant proteins. This method involves 

5 providing first and second chemically modified mutant proteins of the present invention, 
wherein the glycosylation pattern of the second chemically modified mutant protein is 
different from the glycosylation pattern of the first chemically modified mutant protein, 
evaluating a functional characteristic of the first and second chemically modified mutant 
proteins, and correlating the functional characteristic of the first and second chemically 

l o modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. 

Evaluating a functional characteristic of the first and second chemically 
modified mutant protein includes testing for functional characteristics including, but not 
limited to, stability to temperature and reagents, solubility, partitioning, enzymatic 
1 5 activity, cell-cell signaling, substrate specificity, substrate binding, ability to mask an 
antigenic site, physiological functions, and pharmaceutical functions (Paulson, 
"Glycoproteins: What are the Sugar Chains For?", Trends in Bi ochem. Sciences, 14:272- 
276 (1989), which is hereby incorporated by reference). 

Another aspect of the present invention is a second method of determining 
20 the structure-function relationships of chemically modified mutant proteins. This method 
involves providing first and second chemically modified mutant proteins of the present 
invention, wherein at least one different cysteine residue in the second chemically 
modified mutant protein is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate, evaluating a functional characteristic of the first and second chemically 
25 modified mutant proteins, and correlating the functional characteristic of the first and 
second chemically modified mutant proteins with the structures of the first and second 
chemically modified mutant proteins. 

By way of example to illustrate some of its advantages, the following 
discussion will focus on certain proteases which are modified according to the methods o 
30 the present invention. Alkaline serine proteases (subtilisins) are finding increasing use ir 
biocatalysis, particularly in chiral resolution, regioselective acylation of polyfunctional 
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compounds, peptide coupling, and glycopeptide synthesis. As shown in Figure 5, 
subtilisins can catalyze peptide bond formation starting from an ester substrate, by first 
forming an acyl enzyme intermediate which then reacts with a primary amine to form the 
peptide product. This application requires high esterase activity to promote acyl enzyme 
formation and low amidase activity to minimize hydrolysis of the peptide bond of the 
desired product. Generally, subtilisins do not meet these requirements. However, the 
improvement of the esterase to amidase selectivities of subtilisins has been a long sought 
after goal. By using the methods provided for in the present invention, it is possible to 
produce subtilisins which have advantageous properties. 

The inventors in the present case used site specific mutagenesis to modify 
certain residues and introduce additional cysteine residues within subtilisin which would 
then serve to react with a glycosylated methanethiosulfonate to produce a glycosylation 
point at the introduced cysteine. Bacillus lentus subtilisin was selected for illustrative 
purposes because it does not contain a natural cysteine and is not naturally glycosylated. 

The substrate binding site of an enzyme consists of a series of subsites 
across the surface of the enzyme. The portion of substrate that corresponds to the subsites 
are labeled P and the subsites are labeled S. By convention, the subsites are labeled S,, 
S 2 , S 3 , S 4 , S/, and S 2 *. A discussion of subsites can be found in Berger et al., Phil. Trans. 
Roy. Soc. Lond. B , 257:249-264 (1970), Siezen et al., Protein Engineering , 4:719-737 
(1991), and Fersht, Enzyme Structure and Mechanism , 2 ed., Freeman: New York, 29-30 
(1985), which are hereby incorporated by reference. 

In the present illustration, the S l3 S x \ or S 2 subsites were selected as 
suitable targets for modification. In particular, the amino acids corresponding to N62, 
L217, SI 56, and SI 66 in naturally-occurring subtilisin from Bacillus amyloliquefaciens 
or to equivalent amino acid residues in other subtilisins, such as Bacillus lentus subtilisin, 
were selected for modification to cysteine. The mutated subtilisin was produced through 
standard site directed mutagenesis techniques and the obtained mutant subtilisin was 
reacted with certain glycosylated alkylthiosulfonates, particularly glycosylated 
methanethiosulfonates, as provided in the examples appended hereto. 

Enzymatic peptide coupling is an attractive method for preparation of a 
variety of peptides, because this method requires minimal protection of the substrate. 
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proceeds under mild conditions, and does not cause racemization. Wong et al., Enzymes 
in Synthetic Organic Chemistry , Pergamon Press: Oxford, 41-130 (1994), which is hereby 
incorporated by reference. In spite of these advantages, two major problems have limited 
the use of serine proteases in peptide synthesis. One is their efficient proteolytic 
5 (amidase) activity which causes hydrolysis of the coupling product, and the other is their 
stringent structural specificity and stereospecificity. 

Surprisingly, it was found that the chemically modified mutant subtilisins 
of the present invention have altered esterase-to-amidase activity as compared to the 
precursor enzyme. Increasing the esterase-to-amidase ratio enables the use of the enzyme 
10 to more efficiently catalyze peptide synthesis. In particular, subtilisins can catalyze 
peptide bond formation starting from an ester substrate (i.e. an acyl donor), by first 
forming an acyl enzyme intermediate which then reacts with a primary amine (i.e. an acyl 
acceptor) to form the peptide product, as shown in Figure 5. This reaction thus requires 
high esterase activity to promote acyl enzyme formation and, then, low amidase activity 
15 to minimize hydrolysis of the peptide bond of the desired product. The chemically 
modified mutant subtilisins produced according to the present invention show an 
increased esterase-to-amidase ratio, without reducing the absolute esterase activity of the 
enzyme. In addition, certain modified enzymes of the present invention even show a 
concomitant increase in the absolute esterase activity. 
20 Therefore, an unexpected- benefit of subtilisins which are modified 

according to the present invention is that they can be used in organic synthesis to, for 
example, catalyze a desired reaction and/or favor a certain stereoselectivity. See e.g., 
Noritomi et al. Biotech. Bioeng. 51:95-99 (1996); Dabulis et al. Biotech. Bioeng. 41:566- 
571 (1993), and Fitzpatrick et al. J. Am. Chem. Soc. 1 13:3166-3171 (1991), which are 
25 hereby incorporated by reference. 

Proteins obtained using the methods provided herein may be used in any 
application in which it is desired to use such proteins, where having modified functional 
capabilities is advantageous. Thus, proteins modified as provided herein may be used in 
the medical field for pharmaceutical compositions and in diagnostic preparations. 
30 Additionally, proteins such as enzymes which are modified according to the present 
invention may be used in applications which are generally known for such enzymes 
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including industrial applications such as cleaning products, textile processing, feed 
modification, food modification, brewing of grain beverages, starch processing, as 
antimicrobials, and in personal care formulations. Moreover, the unique functionalities 
made possible by the present invention may result in uses for proteins which have not 
5 heretofore been recognized as feasible. 

EXAMPLES 

Example 1 -Preparation of Methanethiosulfonste ("MTS") Reagents 

10 The preparation of NaSS0 2 CH 3 (Kenyon et aL, Methods EnzymoL 

47:407-430 (1977), which is hereby incorporated by reference) has been described 
previously (Berglund et aL, J. Am. Chem. Soc , 1 19:5265-5266 (1997), which is hereby 
incorporated by reference). Acetobromoglucose (3) (See Figure 6) (prepared from D- 
glucose according to Scheurer et aL, J. Am. Chem. Soc , 76:3224 (1954), which is hereby 

1 5 incorporated by reference) in 73% yield, pentaacetylglucose (prepared from the 

corresponding parent carbohydrates according to the method of Verley et aL, Ber. Dtsch. 
Chem. Ges. , 34:3354-3358 (1901), which is hereby incorporated by reference, and 
purified by flash chromatography) in 99% yield, 5d (See Figure 7) (prepared from the 
corresponding parent carbohydrates according to the method of Verley et aL, Ber. Dtsch. 

20 Chem. Ges. , 34:3354-3358 (1901), which is hereby incorporated by reference, and 

purified by flash chromatography) in 92% yield, 5e (See Figure 7) (prepared from the 
corresponding parent carbohydrates according to the method of Verley et aL, Ber. Dtsch. 
Chem. Ges. , 34:3354-3358 (1901), which is hereby incorporated by reference, and 
purified by flash chromatography) in 99% yield, 5f (See Figure 7) (prepared from lactose 

25 according to the method of Hudson et aL, J. Am. Chem. Soc , 37:1270-1275 (1915), 

which is hereby incorporated by reference, and purified by flash chromatography in 82% 
yield) were prepared according to literature methods. N,N-dimethylformamide ("DMF") 
was distilled under N 2 from CaH 2 and stored over molecular sieve under N 2 before use. 
Methanol was distilled from Mg/T 2 under N 2 immediately prior to use. Br(CH 2 ) 2 OH was 

30 stood over and distilled from CaO under reduced pressure and stored under N 2 prior to 
use. All other chemicals were used as received from Sigma-Aldrich (St. Louis, MO) or 



- 22 - 



Baker (Phillipsburg, NJ). All flash chromatography was performed using silica gel 
(Whatman, 60 A, 230-400 Mesh, Clifton, NJ). Melting points were determined using an 
Electrothermal IA9000 series digital melting point apparatus and are uncorrected. IR 
spectra were recorded on Bomem MB or Perkin-Elmer FTIR Spectrum 1000 

5 spectrophotometers. ! H NMR and 1:> C NMR spectra were recorded on Varian Gemini 
200, Unity 400 or Unity 500 NMR spectrometers at the frequencies indicated. Where 
indicated, NMR peak assignments were made using Correlation Spectroscopy ("COSY") 
or Distortionless Enhanced by Polarization Transfer ("DEPT") experiments, all others are 
subjective. All chemical shifts were referenced to residual solvent as an internal standard; 

10 for !3 C NMR in D 2 0 1 ,4-dioxan (67.6 ppm) was used. ES-MS data were acquired using a 
PE SCIEX API III Biomolecular mass spectrometer. All HRMS data were acquired using 
Micromass 70-250S or Micromass ZAB-SE mass spectrometers according to the 
ionization methods indicated. Solvents were removed in vacuo. 

15 Preparation of2 t 3,4,6-Tetra-Q-Gcetyl-$-D-gIucopyranosyl methanethiosulfonate (la). 

Initial approaches to untethered glyco-MTS reagents similar in type to la 
(See Figure 8) were based upon Danishefsky's glycal methodology (Halcomb et ah, I 
Am. Chem. Soc , 1 1 1 :666 1-6666 (1989), which is hereby incorporated by reference). 
Tris-TBS glucal was prepared according to the method of Lesimple et aL, Tetrahedron 

20 Lett. , 27:6201-6204 (1986), which is hereby incorporated by reference, and oxidized to 
tris-TBS protected 1 ,2-anhydroglucose using dimethyldioxirane. However, under a 
variety of conditions and in contrast to the behavior of other sulfur nucleophiles (Gordon 
et aL, Carbohvdr. Res. , 206:361-366 (1990); Berkowitz et al, J. Am. Chem. Soc, 
1 14:4518-4529 (1992), which are hereby incorporated by reference), 

25 methanethiosulfonate ion failed to open the epoxide moiety of tris-TBS protected 1,2- 

anhydro-D-glucose. Deprotection of la (See Figure 8) was attempted under a variety of 
conditions, but in all cases led only to decomposition or hydrolysis of the thioglucosidic 
bond (Zemplen et aL, Ber. Dtsch. Chem. Ges. , 56:1705-1710 (1923); Plattner et aL, J. 
Am. Chem. Soc , 94:8613-8615 (1972); Mori et aL, Tetrahedron Lett. , 20:1329-1332 
30 (1979); Herzig et aL, Carbohvdr. Res. , 153:162-167 (1986); Herzig et aL, J. Org. Chem., 
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51:727-730 (1986); Vekemans et al., Tetrahedron Lett. , 28:2299-2300 (1987); Cinget et 
al., Svnlett , 168-170 (1993), which are hereby incorporated by reference). 

Acetobromoglucose (3) (See Figure 6) (1 g, 2.43 mmol) was added to a 
solution of NaSS0 2 CH 3 (380 mg, 2.84 mmol) in ethanol (4 mL) at 90°C under N 2 . After 

5 20 minutes the resulting suspension was cooled and the solvent removed. The residue 
was purified by flash chromatography (EtOAc : hexane, 9:1 1) and the resulting solid 
recrystallized from ether to give la (See Figure 8) (674 mg, 63%) as a white solid: mp 
151-152°C melts then decomp. (ether); [a] 27 D = -19.0 (c 1.24, CHC1 3 ); IR (KBr) 1749 cm" 
1 (DO), 1333, 1 140 cm"' (S-S0 2 ); 'H NMR (400 MHz, CDC1 3 ) 8 2.00, 2.04, 2.06, 2.07 (s 

10 x 4, 3H x 4, Ac x 4), 3.44 (s, 3H, CH 3 S0 2 -), 3.82 (ddd, J 4 , 5 10.1 Hz, J 5 , 6 5.9 Hz, J 56 - 2.2 
Hz, 1H, H-5), 4.08 (dd, J 5 . 6 5.9 Hz, J M . 12.5 Hz, 1H, H-6), 4.31 (dd, J SA . 2.2 Hz, J 66 - 12.5 
Hz, 1H, H-6'), 5.05 (t, J9.8 Hz, 1H, H-4), 5.07 (dd, J,. 2 10.5 Hz, J 2 . 3 9.4 Hz, 1H, H-2), 
5.25 (d, J,. 2 10.5 Hz,lH, H-l), 5.29 (t, J9.3 Hz, 1H, H-3); l3 C NMR (50 MHz, CDC1 3 ) 5 
20.5, 20.7 (CH 3 COO- x 4), 52.8 (CH 3 S0 2 -), 61.8, 68.0, 68.7, 73.3, 76.6 (C-2, C-3, C-4, 

15 C-5, C-6), 86.4 (C-l), 169.3, 169.3, 169.7, 170.1 (CH 3 COO- x 4); HRMS m/z (EI+): 
Found 443.0636 (M+H~); C 15 H 23 0„S 2 requires 443.0682. 

Preparation of2-(2, 3, 4, 6-Tetra-0-acetyl-a-D-glucopyranosyl)ethyl methanethiosulfonate 
(lg). 

20 BF 3 .Et 2 0 (145 uL, 1.1 mmol) was added dropwise to a suspension of D- 

glucose (2a) (See Figure 6) (1.45 g, 8.1 mmol) in Br(CH 2 ) 2 OH (19 mL) under N 2 and the 
resulting mixture heated to 105°C. After 8 hours, the resulting solution was cooled and 
the solvent removed. The residue was dissolved in Ac 2 0/pyridine (2:3 v/v, 16 mL) under 
N,. After a further 24 hours, the reaction solvent was removed and the residue purified by 

25 repeated flash chromatography (EtOAc then EtOAc : hexane, 3:7) to give 2-bromoethyl 
2,3,4,6-tetra-O-acetyl-a-D-glucopyranoside (4g) (See Figure 6) (1.76 g, 48%) as a 
colorless oil that crystallized on standing to give a white solid; mp 86-88°C; [a] 23 D = + 
130.6 (c 0.21, CHC1 3 ); IR (film) 1749 cm" 1 (CO); 'H NMR (400 MHz, CDC1 3 ) 5 2.01, 
2.03, 2.07, 2.09 (s x 4, 3H x 4, Ac x 4), 3.51 (t, J5.9 Hz, 2H, -CH 2 Br), 3.83 (dt, ^ 11.6 

30 Hz, J, 5.8 Hz, 1H, -OCHH'-), 3.96 (dt, J d 11.6 Hz, J, 5.8 Hz, 1H, -OCHH>), 4.10 (dd, J 5 , 6 
2.2 Hz, J 5 . 6 - 12.0 Hz, 1H, H-6), 4.14 (ddd, J 4 5 10.2 Hz, J 5 . 6 2.2 Hz, J 5 . 6 - 4.4 Hz, 1H, H-5), 
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4.24 (dd, J iA . 4.4 Hz, J 6S 12.0 Hz, 1H, H-6'), 4.84 (dd, J, 2 3.8 Hz, J 23 10.3 Hz, 1H, H-2), 
5.05 (t, J 9.7 Hz, 1H, H-4), 5.14 (d, J L2 3.8 Hz, 1H, H-l), 5.49 (dd, J 2 . 3 10.3 Hz, J 3 . 4 9.5 
Hz, 1H, H-3); ,3 C NMR (100 MHz, CDC1 3 ) 5 20.6, 20.7 (CH 3 COO- x 4), 29.9 (-CH 2 Br), 
61.9, 67.8, 68.5, 68.8, 70.0, 70.8 (-OCH 2 -, C-2, C-3, C-4, C-5, C-6), 96.0 (C-l), 169.6, 
170.0, 170.2, 170.6 (CH 3 COO- x 4); HRMS m/z (FAB+): Found 477.0381 (M+Na + ); 
C 16 H 23 O 10 BrNa requires 477.0372. NaSS0 2 CH 3 (75 mg, 0.56 mmol) was added to a 
solution of 4g (See Figure 6) (190 mg, 0.42 mmol) in DMF (6 mL) under N 2 and warmed 
to 50 °C. After 21 hours, the solution was cooled and the solvent removed. The residue 
was purified by flash chromatography (EtOAc : hexane, 1 : 1) to give lg (See Figure 6) 
(183 mg, 90%) as a colorless oil; [cc] 27 D = + 92.1 (c 0.39, CHC1 3 ); IR (film) 1748 cm" 1 
(C=0), 1322, 1134 cm" 1 (S-S0 2 ); 'H NMR (400 MHz, CDC1 3 ) 5 2.01, 2.03, 2.07, 2.09 (s x 
4, 3H x 4, Ac x 4), 3.41 (t, J5.7Hz, 2H, -CH 2 S-), 3.41 (s, 3H, CH 3 S0 2 -), 3.75 (dt, J d 10.8 
Hz, J t 5.7 Hz, 1H, -OCHH'-), 3.99-4.06 (m, 2H, H-5, -OCHH», 4.09 (dd, J 5 . 6 2.4 Hz, J 6 6 - 
12.6 Hz, 1H, H-6), 4.25 (dd, J 5 . 6 . 4.6 Hz, J 6 , 6 . 12.6 Hz, 1H, H-6'), 4.87 (dd, J K2 3.9 Hz, J 2 . 3 
10.3 Hz, 1H, H-2), 5.06 (t, J9.8 Hz, 1H, H-4), 5.12 (d, J l2 3.9 Hz, 1H, H-l), 5.43 (t, J 9.8 
Hz, 1H, H-3); ,3 C NMR (100 MHz, CDC1 3 ) 5 20.6, 20.6, 20.7, 20.7 (CH 3 COO- x 4), 36.0, 
(-CH 2 S-), 50.8 (CH 3 SO r ), 61.8 (-OCH,-), 67.0, 67.8, 68.3, 69.8, 70.7 (C-2, C-3, C-4, C-5, 
C-6), 96.0 (C-l), 169.5, 170.0, 170.6 (CH 3 COO- x 4); HRMS m/z (FAB+): Found 
487.0946 (M+H + ); C 17 H 27 0 12 S, requires 487.0944. 

Preparation of2-(a-D-Glucopyranosyl)ethyl methane thiosulfonate (lb). 

A solution of NaOMe (0.1M, 0.3 mL) was added to a suspension of 4g 
(See Figure 6) (300 mg, 0.66 mmol) in MeOH (3 mL) under N, and stirred vigorously. 
After 6 hours, the resulting solution was passed through a Dowex 50W(H~) plug (2 x 1 
cm, eluant MeOH) and the solvent removed to give 2-bromoethyl cc-D-glucopyranoside 
bromide (4b) (See Figure 6) (178 mg, 94%) (The synthesis of 4b as an intermediate has 
been described previously. However, this method gave only a poor yield of product. 
Nagai et al., Carbohvdr. Res. , 190:165-180 (1989), which is hereby incorporated by 
reference) as a white solid. NaSS0 2 CH 3 (100 mg, 0.75 mmol) was added to a solution of 
4b (See Figure 6) (178 mg, 0.62 mmol) in DMF (7 mL) under N 2 and warmed to 50 °C. 
After 25 hours, the solution was cooled and the solvent removed. The residue was 
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purified by flash chromatography (MeOH : EtOAc, 1:9) to give lb (See Figure 6) (144 
mg, 73%) as a hygroscopic foam; [a] 27 D = + 109.9 (c 1.1 1, H 2 0); IR (film) 3423 cm" 1 
(OH), 1309, 1128 cm"' (S-S0 2 ); 'H NMR (500 MHz, D 2 0, COSY) 5 3.16 (t, J 9.5 Hz, 1H, 
H-4), 3.28 (t, J 5.9 Hz, 2H, -CH 2 S-), 3.30 (s, 3H, CH 3 SO r ), 3.31 (dd, J u 3.8 Hz, J 23 9.9 
Hz, 1H, H-2). 3.44 (t, J 9.5 Hz, 1H, H-3), 3.47-3.53 (m, 2H, H-6, H-6'), 3.58-3.61 (m, 
1H, H-5), 3.62 (dt. J { 5.4 Hz. J d 10.8 Hz, 1H, -OCHH'-), 3.79 (dt. J t 6.3 Hz, J d 10.8 Hz, 
1H, -OCHH1). 4.72 (d. J, , 3.8 Hz, 1H, H-l); 13 C NMR (100 MHz. D 2 0) 8 36.7 (-CH 2 S-), 
50.7 (CH,S0 2 -), 61.5 (-OCH 2 -), 67.2, 70.5, 72.3, 73.2, 74.0 (C-2, C-3, C-4, C-5, C-6), 
99.4 (C-l); HRMS m/z (FAB+): Found 319.0517 (M+H~); C 9 H !9 0 8 S 2 requires 319.0521. 

Preparation of 2-(2, 3, 4. 6-Tetra-0-acetyl-$-D-glucopyranosyl)ethyl methanethiosulfonate 
(lh). 

BF 3 .Et 2 0 (3.3 mL, 26.0 mmol) was added dropwise over the course of 15 
minutes to a solution of 1,2,3, 4,6-penta-0-acetyl-a,(3-D-glucose (2 g, 5.1 mmol) and 
Br(CH 2 ) 2 OH (0.45 mL, 6.3 mmol) in CH 2 C1 2 (9 mL) at 0 °C under N 2 . After 1.5 hours, 
the solution was warmed to room temperature. After 20 hours the reaction solution was 
added to ice water (15 mL) and extracted with CH 2 C1 2 (15 mL x 3). These extracts were 
combined, washed with water (15 mL), sat. NaHC0 3 (aq., 15 mL), water (15 mL), dried 
(MgS0 4 ), filtered, and the solvent removed. The residue was purified by flash 
chromatography (EtOAc : hexane, 1:3) to give 2-bromoethyl 2,3,4,6-tetra-<9-acetyl-(3-D- 
glucopyranoside (4h) (See Figure 6) (1.42 g, 61%) as a white solid; mp 1 18-120 °C 
(EtOAc/zso-octane) [lit., (Coles et al., J. Am. Chem. Soc . 60:1020-1022 (1938), which is 
hereby incorporated by reference) mp 1 17.3 °C (EtOH)]; [a] 27 D = - 1 1.9 (c 1.65, CHC1 3 ) 
[lit., (Helferich et al., lust. Lieb. Ann. Chem. , 541:1-16 (1939), which is hereby 
incorporated by reference) [a] 20 D = - 12.3 (c 0.2, CHC1 3 )]; 'H NMR (200 MHz, CDC1 3 ) 5 
2.00, 2.02, 2.07, 2.09 (s x 4, 3H x 4, Ac x 4), 3.42-3.51 (m, 2H), 3.67-3.87 (m, 2H), 4.10- 
4.31 (m, 3H), 4.57 (d, J x2 8 Hz, 1H, H-l), 4.97-5.27 (m, 3H). NaSS0 2 CH 3 (260 mg, 1.94 
mmol) was added to a solution of 4h (See Figure 6) (640 mg, 1.41 mmol) in DMF (18 
mL) under N, and warmed to 50°C. After 25 hours, the solution was cooled and"the 
solvent removed. The residue was purified by flash chromatography (EtOAc : hexane, 
1:1) and the resulting solid recrystallized from EtOAc/hexane to give lh (See Figure 6) 
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(544 mg, 80%) as a white solid; mp 1 15-1 16°C (EtOAc/hexane); [a] 27 D = + 5.4 (c 1.06. 
CHC1 3 ); IR(KBr) 1758. 1741 cm'' (C=0), 1314. 1133 cm"' (S-SO,); 'HNMR(500 MHz, 
CDC1 3 , COSY) 5 1.99. 2.02, 2.06, 2.08 (s x 4, 3H x 4. Ac x 4), 3.30-3.38 (m, 2H, -CH 2 S- 
), 3.34 (s, 3H, CH 3 S0 2 -). 3.70 (ddd, J 4 . 5 9.9 Hz, J iA 2.2 Hz, J 5A - 4.6 Hz, 1H, H-5), 3.83 
(ddd, J 5.6 Hz, J 7 A Hz. J 10.5 Hz, 1H, -OCHH"-), 4.13-4.18 (m, 2H, H-6, -OCHH». 
4.24 (dd, J 56 . 4.6 Hz, J 6b 12.4 Hz. 1H, H-6'), 4.55 (d. J, , 8.1 Hz, 1H, H-l), 4.98 (dd, J, , 

8.1 Hz, J 22 9.7 Hz, 1H. H-2), 5.07 (t, J9.9 Hz, 1H, H-4), 5.19 (t, J9.6 Hz, 1H, H-3); ,3 C 
NMR (125 MHz, CDC1 3 . DEPT) 5 20.5, 20.7 (q x 2, CH 3 COO- x 4), 36.0, (t, -CH 2 S-), 

50.6 (q, CH 3 SCy), 61.6 (t, -OCH,-), 68.1, 70.8, 71.9, 72.5 (d x 4, C-2, C-3, C-4, C-5), 

68.4 (t, C-6), 100.8 (d, C-l), 169.3, 170.0, 170.5 (s x 3, CH 3 COO- x 4); HRMS m/z 

(FAB+): Found 487.0940 (M+H"); C 17 H 27 0 12 S 2 requires 487.0944. 

Preparation of 2-($-D-Glncopyranosyl)ethyl methanethiosulfonate (lc). 

A solution of NaOMe (0.1M, 0.3 mL) was added to a suspension of 4h 
(See Figure 6) (300 mg, 0.66 mmol) in MeOK (3 mL) under N 2 and stirred vigorously. 
After 4 hours, the resulting solution was passed through a Dowex 50W(H") plug (2 x 1 
cm, eluant MeOH) and the solvent removed to give 2-bromoethyl (3-D-glucopyranoside 4c 
(See Figure 6) (176 mg. 93%) as a white solid that was used directly in the next step. A 
sample was recrystallized from EtOH/EtOAc to give a colorless, crystalline solid; mp 74- 
78°C (EtOH/EtOAc) [lit.. (Helferich et al., Just. Lieb. Ann. Chem. , 541:1-16 (1939), 
which is hereby incorporated by reference) mp 74-75°C (EtOH/EtOAc)]; [a] 26 D = - 22.4 
(c 1.63, H 2 0) [lit., (Helferich et al., Just. Lieb. Ann. Chem. , 541:1-16 (1939), which is 
hereby incorporated by reference) [a]' 9 D = - 26.1 (c 3.0, H 2 0)]; ] H NMR (400 MHz, 
CD 3 OD) 5 3.30 (t, J 8.4 Hz, 1H, H-2), 3.39-3.49 (m, 3H), 3.64-3.80 (m, 3H), 3.97 (br d, 
J 6A , 1 1.7 Hz, 1H, H-6'), 4.02 (dt, J t 6.5 Hz, J, 11.3 Hz, 1H, -OCHH'-), 4.23 (dt, J x 6.5 Hz, 
J d 1 1.3 Hz, 1H, -OCHH», 4.44 (d, J u2 7.9 Hz, 1H, H-l). NaSS0 2 CH 3 (100 mg, 0.75 
mmol) was added to a solution of 4c (See Figure 6) (176 mg, 0.61 mmol) in DMF (7 mL) 
under N 2 and warmed to 50°C. After 15 hours, the solution was cooled and the solvent 
removed. The residue was purified by flash chromatography (MeOH : EtOAc, 1:9) to 
give lc (See Figure 6) (144 mg, 74%) as a hygroscopic foam; [a] 27 D = - 15.8 (c 0.88, 
H 2 0); IR (KBr) 3400 cm" 1 (OH), 1310, 1 131 cm" 1 (S-S0 2 ); 'H NMR (500 MHz, D 2 0, 
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COSY) 8 3.07 (dd, J, 2 8.1 Hz, J 23 9.4 Hz, 1H, H-2), 3.16 (dd, J, 4 9.0 Hz, J 45 9.8 Hz, 1H, 
H-4), 3.24 (ddd, J 4 . 5 9.8 Hz, J 16 6.0 Hz, J 56 - 2.3 Hz, 1H, H-5), 3.27 (t, 79.0 Hz, 1H, H-3), 
3.30-3.33 (m, 2H, -CH 2 S-), 3.34 (s, 3H, CH 3 S0 2 -), 3.50 (dd, J 55 6.0 Hz, J u , 12.4 Hz, 1H, 
H-6),3.69 (dd,J 56 .2.3 Hz,J 6 . 6 - 12.4 Hz, 1H, H-6'), 3.81 (dt, J x 5.8 Hz, J d 1 1.5 Hz, 1H,- 
OCHH'-), 4.00 (dt, J t 5.7 Hz, J d 11.4 Hz, 1H, -OCHH», 4.30 (d. J, 2 8.1 Hz, 1H, H-l); 
n C NMR (50 MHz, D 2 0) 5 36.9 (-CH,S-), 51.0 (CH 3 SO r ), 62.0 (-OCH,-), 69.5, 70.9, 
74.3, 76.7, 77.3 (C-2, C-3. C-4, C-5, C-6), 103.7 (C-l): HRMS m/z (FAB+): Found 
341.0351 (M+Na + ); C 9 H ls O g S 2 Na requires 341.0341. 

Preparation of 2-(2, 3, 4, 6-Tetra-0-acetyl-a-D-mannopyranosyl)ethyl 
methanethiosulfonate (li). 

BF 3 .Et 2 0 (7.7 mL, 60.7 mmol) was added dropwise over the course of 15 
minutes to a solution of 1,2,3 ,4,6-penta-0-acetyl-a,P-D-mannose (5d) (See Figure 7) (4.7 
g, 12.1 mmol) and Br(CH 2 ) 2 OH (1.05 mL, 14.8 mmol) in CH,C1, (22 mL) at 0°C under 
N 2 . After 1 hour, the solution was warmed to room temperature. After 25 hours, the 
reaction solution was added to ice water (20 mL) and extracted with CH 2 C1 2 (20 mL x 2). 
These extracts were combined, washed with water (20 mL), sat. NaHC0 3 (aq., 20 mL), 
water (20 mL), dried (MgS0 4 ), filtered, and the solvent removed. The residue was 
crystallized from EtOAc/wo-octane to give 2-bromoethyl 2,3,4,6-tetra-O-acetyl-a-D- 
mannopyranoside (4i) (See Figure 7) (3.52 g, 64%). Purification of the resulting mother 
liquor by flash chromatography (EtOAc:hexane, 1 :3) gave further 4i (See Figure 7) (320 
mg, 6%; 70% in total) as a white highly crystalline solid; mp 121-123°C [lit., (Dahmen et 
al., "2-Bromoethyl Glycosides - Synthesis and Characterization," Carbohvdr. Res. , 
1 16:303-307 (1983), which is hereby incorporated by reference) 118-119°C (EtOAc/wo- 
octane)]; [a] 28 D = + 48.3 (c 1.31, CHC1 3 ) [lit., (Dahmen et al., "2-Bromoethyl Glycosides 
- Synthesis and Characterization," Carbohvdr. Res. . 1 16:303-307 (1983), which is hereby 
incorporated by reference) [a] 23 D = + 45 (c 0.6, CDC1 3 )]; 'H NMR (200 MHz, CDC1 3 ) 5 
1.99, 2.05, 2.10, 2.16 (s x 4, 3H x 4, Ac x 4), 3.52 (t, J 6 Hz, 2H, -CH 2 Br), 3.82-4.04 (m, 
2H, -OCH 2 -), 4.09-4.16 (m, 1H, H-5), 4.13 (dd, J sfi 2 Hz, J M . 12 Hz, 1H, H-6) 4.-28 (dd, 
J 5 6 . 6 Hz, J 6 . 6 - 12 Hz, 1H, H-6'), 4.87 (br s, 1H, H-l), 5.22-5.40 (m, 3H, H-2, H-3, H-4). 
NaSS0 2 CH 3 (230 mg, 1.72 mmol) was added to a solution of 4i (See Figure 7) (600 mg, 
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1.32 mmol) in DMF (17 mL) under N 2 and warmed to 55°C. After 20 hours, the solution 
was cooled and the solvent removed. The residue was purified by flash chromatography 
(EtOAc : hexane, 9:1 1) and the resulting solid recrystallized from Et,0/hexane to give li 
(See Figure 7) (566 mg, 88%) as a white solid; mp 128-129°C (Et 2 0/hexane); [cc] 27 D = + 
53.2 (c 0.92, CHCLj); IR(KBr) 1739 cm" 1 (C=0). 1325. 1 129 cm"' (S-S0 2 ); 'HNMR(500 
MHz, CDC1 3 , COSY) 5 1.97, 2.04, 2.09, 2.14 (s x 4. 3H x 4, Ac x 4), 3.37-3.40 (m, 2H, - 
CH 2 S-), 3.38 (s, 3H, CH 5 S0 2 -), 3.79 (dt, J d 10.5 Hz. J t 5.8 Hz, 1H, -OCHH'-), 3.98-4.03 
(m, 2H, -OCHH1-, H-5), 4.09 (dd, J 56 2.5 Hz, J 65 - 12.5 Hz, 1H, H-6), 4.26 (dd, J 56 - 5.6 
Hz, J 66 . 12.5 Hz, 1H, H-6'), 4.85 (d, J l 2 0.7 Hz, 1H, H-l), 5.23-5.29 (m, 3H, H-2, H-3, H- 
4); ,3 C NMR (50 MHz, CDCUJ 5 20.6, 20.7, 20.8 (CH 3 COO- x 4), 35.7, (-CH,S-), 50.8 
(CH 3 S0 2 -), 62.5 (-OCH r ), 66.0, 66.8, 69.0, 69.2, 69.3 (C-2, C-3, C-4, C-5, C-6), 97.7 (C- 
1), 169.7, 169.9, 170.0, 170.6 (CH 3 COO- x 4); HRMS m/z (FAB+): Found 487.0954 
(M+H + ); C 17 H 27 0 12 S 2 requires 487.0944. 

Preparation of 2-(<x-D-Mannopyranosyl) ethyl methanethiosulfonate (Id). 

A solution of NaOMe (0.143 M, 0.7 mL) was added to a suspension of 4i 
(See Figure 7) (1 g, 2.2 mmol) in MeOH (10 mL) under N,. After 3 hours, the resulting 
solution was passed through a Dowex 50W(H + ) plug (2x1 cm, eluant MeOH) and the 
solvent removed. The residue was purified by flash chromatography (MeOH : EtOAc, 
2:25) to give 2-bromoethyl a-D-mannopyranoside 4d (See Figure 7) (The use of 4d as a 
reactant has been described previously, although no details of preparation or 
characterization were given. (U.S. Patent 4 918 009 to Nilsson, which is hereby 
incorporated by reference)) (606 mg, 96%) as a white foam; [a] 26 D = + 50.7 (c 0.91, H 2 0); 
IR (KBr) 3417 cm" 1 (OH); 'H NMR (500 MHz, D,0, COSY) 5 3.38-3.44 (m, 3H, H-4, - 
CH 2 Br), 3.50-3.55 (m, 2H, H-5, H-6), 3.60 (dd, J 23 3.5 Hz, J 3A 9.7 Hz, 1H, H-3), 3.66 (dd, 
J Sj6 . 4.6 Hz, J 6A , 1 1 .2 Hz, 1H, H-6'), 3.68 (ddd, J 4.6 Hz, J 5.4 Hz, J 1 1.7 Hz, 1H, - 
OCHH'-), 3.76(dd,y L2 1.8 Hz, J 23 3.5 Hz, 1H, H-2). 3.81 (ddd, J 5.1 Hz, J6.5 Hz, J 
11.7, 1H, -OCHH1-), 4.71 (d,J :>2 1.8 Hz, 1H, H-l); l3 C NMR (100 MHz, D 2 0)5 32.1 (- 
CH 2 Br), 61.7 (-OCH,-), 67.5, 68.4, 70.7, 71.3, 73.8 (C-2, C-3, C-4, C-5, C-6), 100.5 (C- 
1); HRMS m/z (FAB+): Found 308.9985 (M+Na + ); C g H, 5 0 6 79 BrNa requires 308.9950. 
NaSSO,CH 3 (150 mg, 1.12 mmol) was added to a solution of 4d (See Figure 7) (245 mg, 
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0.85 mmol) in DMF ( 1 0 mL) under N 2 and warmed to 50°C. After 16 hours, the solution 
was cooled and the soh ent removed. The residue was purified by flash chromatography 
(MeOH : EtOAc, 1:9) to give Id (See Figure 7) (217 mg, 80%) as a hygroscopic foam; 
[a] 29 D = + 58.0 (c 1.34, FLO); IR (KBr) 3441 cm" 1 (OH), 1314, 1132 cm" 1 (S-S0 2 ); 'H 
NMR (500 MHz, D 2 0) 5 3.3 1 (t, J 5.8 Hz, 2H, -CH 2 S-), 3.35 (s. 3H, CH 3 SO r ), 3.45 (t, J 
9.6 Hz, 1H, H-4), 3.49 (ddd. J A 5 9.8 Hz, J 5 . 6 5.8 Hz, J 5A . 1.9 Hz. 1H, H-5), 3.55 (dd, J 5 . 6 5.8 
Hz, J 66 - 12.1 Hz, 1H, H-6). 3.60 (dd, J 2J 3.4 Hz, J 3A 9.0 Hz, 1H, H-3), 3.66 (dt, J A 10.7 Hz, 
J, 5.7 Hz, 1H, -OCHH'-). 3.69 (dd, J u . 1.9 Hz, J M - 12.1 Hz, 1H, H-6'), 3.77 (dd, J,. 2 1.6 
Hz, J 2 . 3 3.4 Hz, 1H, H-2). 3.83 (dt, J d 1 1 .0 Hz, J t 5.9 Hz, 1H, -OCHH», 4.72 (d, J L2 1.6 
Hz, 1H, H-l); 13 C NMR (125 MHz, D 2 0) 8 36.7 (-CH 2 S-), 50.7 (CH 3 S0 2 -), 61.9 (-OCH 2 - 
), 66.7, 67.7, 70.9, 71.5, 74.0 (C-2, C-3, C-4, C-5, C-6), 100.8 (C-l); HRMS mfz (FAB+): 
Found 319.0528 (M+H~); C 9 H 19 O g S 2 requires 319.0521. 

Preparation of 2-(2,3, 4, 6-Tetra-0-acetyl-$-D-galactopyranosyl)ethyl 
methanethiosulfonate (lj). 

BF 3 .Et 2 0 (8.5 mL, 67.0 mmol) was added dropvvise to a solution of of 
l,2,3,4,6-penta-0-acetyl-a,p-D-galactose (5e) (See Figure 7) (5.1 g, 13.1 mmol) and 
Br(CH 2 ) 2 OH (1.15 mL, 16.2 mmol) in CH 2 C1 2 (24 mL) at 0°C under N 2 . After 1 hour, the 
solution was warmed to room temperature. After 24 hours, the reaction solution was 
added to ice water (20 mL) and extracted with CH 2 C1 2 (30 mL x 3). These extracts were 
combined, washed with water (20 mL), sat. NaHC0 3 (aq., 20 mL), water (20 mL), dried 
(MgS0 4 ), filt Vogtle ered, and the solvent removed. The residue purified by flash 
chromatography (EtOAc : hexane, 1 : 3) to give 2-bromoethyl 2,3,4,6-tetra-O-acetyl-p-D- 
galactopyranoside (4j) (See Figure 7) (4.01 g, 67%) as a white solid; mp 1 16-1 17°C 
(EtOAc/hexane) [lit., (Coles et al., J. Am. Chem. Soc , 60:1020-1022 (1938), which is 
hereby incorporated by reference) 11 1°C; lit., (Dahmen et al., "2-Bromoethyl Glycosides 
- Synthesis and Characterization," Carbohvdr. Res. . 1 16:303-307 (1983), which is hereby 
incorporated by reference) 1 14-1 16°C (EtOAc/light pet. ether)]; [cc] 27 D = - 3.8 (c 0.81, 
CHC1 3 ) [lit., (Dahmen et al., "2-Bromoethyl Glycosides - Synthesis and 
Characterization," Carbohvdr. Res. , 1 16:303-307 (1983), which is hereby incorporated by 
reference) [cc] 23 D = - 5 (c 1 .4, CDC1 3 )]; 'H NMR (200 MHz, CDC1-) 8 1 .98, 2.05, 2.08, 
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2.15 (s x 4. 3H x 4, Ac x 4). 3.43-3.50 (m, 2H), 3.75-3.95 (m, 2H), 4.12-4.24 (m, 3H), 
4.53 (d, J, 2 8 Hz, 1H, H-l),5.02 (dd,J 2 . 3 11 Hz, J 3 . 4 3 Hz, 1H, H-3), 5.23 (dd, J, 2 8 Hz, 
J, 3 1 1 Hz, 1H, H-2), 5.40 (br d, J 3A 3 Hz, 1H, H-4). NaSS0 2 CH 3 (85 mg, 0.63 mmol) was 
added to a solution of 4j (See Figure 7) (223 mg, 0.49 mmol) in DMF (6 mL) under N 2 
5 and warmed to 55°C. After 30 hours, the solution was cooled and the solvent removed. 
The residue was purified by flash chromatography (EtOAc : hexane, 1 :1) to give lj (See 
Figure 7) (198 mg, 83%) as a white foam; [a] 27 D = + 9.1 (c 1.41.CHC1 3 ); IR (film) 1747 
cm' 1 (C=0). 1320, 1 133 cm" 1 (S-SO,); 'H NMR (500 MHz, CDC1 3 ) 8 1.98, 2.05, 2.09, 
2.15 (s x 4, 3H x 4, Ac x 4), 3.35 (s, 3H, CH,S0 2 -), 3.35-3.38 (m, 2H, -CH,S-), 3.84 
0 (ddd, J 6.1 Hz, J 7.1 Hz. J 10.5 Hz, 1H, -OCHH'-), 3.92 (td, J, 5 1.1 Hz, J t 6.6 Hz, 1H, H- 
5), 4.10-4.21 (m, 3H, H-6, H-6\ -OCHH», 4.52 (d, J X2 8.0 Hz, 1H, H-l), 5.01 (dd, J 2 . 3 
10.3 Hz, J JA 3.5 Hz, 1H. H-3), 5.20 (dd, J y2 8.0 Hz, J 23 10.3 Hz, 1H, H-2), 5.40 (dd, J 3 . 4 
3.5 Hz, J 4 . 3 1.1 Hz, 1H, H-4); 13 C NMR (100 MHz, CDC1 3 ) 5 20.6, 20.7, 20.8 (CH 3 COO- 
x 4), 36.1, (-CH 2 S-), 50.6 (CH 3 S0 2 -), 61.2 (-OCH 2 -), 67.0, 68.3, 68.5, 70.8, 71.0 (C-2, C- 
15 3, C-4, C-5, C-6), 101.3 (C-l), 169.5, 170.0, 170.1, 170.4 (CH 3 COO- x 4); HRMS m/z 
(FAB+): Found 487.0936 (M+H + ); C, 7 H 27 O l2 S 2 requires 487.0944. 

Preparation of 2-($-D-Galactopyranosyl) ethyl methanethiosulfonate (le). 

A solution of NaOMe (0.104 M, 0.8 mL) was added to a solution of 4j 
20 (See Figure 7) (778 mg. 1 .71 mmol) in MeOH (10 mL) under N 2 . After 4 hours, the 
reaction solution was passed through a Dowex 50W(H + ) plug (3 x 1 cm, eluant MeOH) 
and the solvent removed to give 2-bromoethyl (3-D-galactopyranoside (4e) (See Figure 7) 
(450 mg, 92%) (The synthesis of unstable 4e has been described previously. Dahmen et 
al., "2-Bromoethyl Glycosides .4.2-Bromoethyl Glycosides in Glycoside Synthesis - 
25 Preparation of Glycoproteins Containing Alpha-L-Fuc-( 1 ->2)-D-Gal and Beta-D-Gal-( 1 - 
>4)-D-Glcnac," Carbohvdr. Res. . 125:237-245 (1984), which is hereby incorporated by 
reference) as a white solid which was used directly in the next step. NaSS0 2 CH 3 (180 
mg, 1 .34 mmol) was added to a solution 4e (See Figure 7) (290 mg, 1 .01 mmol) in DMF 
(12 mL) under N 2 and warmed to 50°C. After 15 hours, the solution was cooled and the 
30 solvent removed. The residue was purified by flash chromatography (MeOH : EtOAc, 
1 :9) to give le (See Figure 7) (229 mg, 71%) as a white foam; [ct] 27 D = + 2.9 (c 0.58, 
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H 2 0); IR (film) 3358 cm"' (br, O-H), 1306, 1 120 cm" 1 (S-S0 2 ); 'H NMR (500 MHz, D,0, 
COSY) § 3.29-3.33 (m, 2H. -CH 2 S-), 3.30 (dd, J, 2 7.7 Hz, J 23 10.0 Hz, 1H, H-2), 3.35 (s, 
3H, CHjS0 2 -), 3.43 (dd, J 2J 10.0 Hz, J 3A 3.6 Hz, 1H, H-3), 3.48 (ddd, J 4J 0.9 Hz, J 56 4.3 
Hz, J 56 - 7.9 Hz, 1H, H-5). 3.52 (dd, J 56 4.3 Hz, J 66 - 1 1.7 Hz, 1H, H-6), 3.57 (dd, J 5A - 7.9 
5 Hz, J 66 - 11.7 Hz, lH,H-6-),3.70(dd.J 3J 3.6Hz,J 43 0.9 Hz, 1H, H-4), 3.80 (dt, J d 11.2 
Hz, J t 6.1 Hz, 1H, -OCHTT-), 4.01 (dt, J d 11.4 Hz, J, 5.8 Hz, 1H, -OCHH», 4.24 (d. J { 2 
7.7 Hz, 1H, H-l); ''C NMR (100 MHz, D,0) 5 36.7 (-CH 2 S-), 50.8 (CH 3 S0 2 -), 61.9 (- 
OCH r ), 69.2, 69.6, 71.7, 73.7, 76.2 (C-2, C-3, C-4, C-5, C-6), 104.0 (C-l); HRMS m/z 
(FAB+): Found 319.0523 (M+H + ); C 9 H, 9 O s S, requires 319.0521. 

10 

Preparation of 2-(2, 3, 6-Tri-0-acetyl-4-0-(2. 3, 4, 6-tetra-0-acetyl-$-D-galactopyranosyl)- 
$-D-glucopyranosyl) ethyl methanethiosulfonate (lk). 

BF 3 .Et 2 0 (4.0 mL, 3 1 .5 mmol) was added dropwise to a solution of 
1 5 1 ,2,3 ,6-tetra-0-acetyl-4-0-(2,3,4,6-tetra-0-acetyl-p-D-galactopyranosyl)-(3-D- 

glucopyranoside (5f) (See Figure 7) (5 g, 7.4 mmol) and Br(CH 2 ) 2 OH (0.65 mL, 9.2 
mmol) in CH 2 C1 2 (15 mL) at 0°C under N 2 . After 1 hour, the solution was warmed to 
room temperature. After 20 hours, the reaction solution was added to ice water (15 mL) 
and extracted with CH 2 C1 2 (20 mL x 2). These extracts were combined, washed with 
20 water (20 mL), sat. NaHC0 3 (aq., 20 mL), water (20 mL), dried (MgS0 4 ), filtered, and the 
solvent removed. The residue was purified by flash chromatography (EtOAc : hexane, 1 : 
1) to give 2-bromoethyl 2.3,6-tri-0-acetyl-4-0-(2,3,4,6-tetra-0-acetyl-(3-D- 
galactopyranosyl)-P-D-glucopyranoside (4k) (See Figure 7) (2.94 g, 53%) as a white 
foam; [a] 27 D = - 7.8 (c 1.28, CHC1 3 ) [lit., (Dahmen et al., "2-Bromoethyl Glycosides - 
25 Synthesis and Characterization," Carbohvdr. Res. , 1 16:303-307 (1983), which is hereby 
incorporated by reference) [a] 23 D = - 11 (c 1.3, CHC1 3 )]; 'H NMR (500 MHz, CDC1 3 , 
COSY) 5 1.94, 2.02, 2.02 (s x 3, 3H x 3, Ac x 3), 2.04 (s, 6H, Ac x 2), 2.10, 2.13 (s x 2, 
3H x 2, Ac x 2), 3.38-3.46 (m, 2H, -CH 2 Br), 3.59 (ddd, J 5 . 6 „- 2.2 Hz, J4.9 Hz, J9.9 Hz, 
1H, H-5'), 3.75-3.80 (m, 2H, H-4', -OCHH'-), 3.85 (td,J 45 1.1 Hz, J, 6.9 Hz, 1H, H-5), 
■ 30 4.03-4.12 (m, 4H, H-6, H-6', H-6", -OCHH», 4.45 (d, J 1>2 7.8 Hz, 1H, H-l), 4.48 (dd, 

J 5 , 6 - 2.2 Hz, J 6 ., 6 ... 12.1 Hz, 1H, H-6'"), 4.50 (d, J vx 7.9 Hz, 1H, H-l'), 4.89 (dd, J„ r 7.9 
Hz, J rj - 9.6 Hz, 1H, H-2'), 4.92 (dd, J 23 10.5 Hz, J 3A 3.4 Hz, 1H, H-3), 5.08 (dd, J,. 2 7.8 
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Hz, J 2J 10.5 Hz, 1H, H-2), 5.18 (t, J 9.6Hz, 1H, H-3"). 5.32 (dd, J 3 . 4 3.4 Hz, J 4 5 1.1 Hz, 
1H, H-4). NaSS0 2 CH, (87, mg, 0.65 mmol) was added to a solution of 4k (See Figure 7) 
(357 mg, 0.48 mmol) in DMF (6 mL) under N 2 and warmed to 50 °C. After 22 hours, the 
solution was cooled and the solvent removed. The residue was purified by flash 
5 chromatography (EtOAc : hexane, 1 1 :9) to give lk (See Figure 7) (327 mg, 88%) as a 
white foam; [a] 27 D = - 3.7 (c 1.0, CHC1 3 ); IR (KBr) 175 1 cm"' (C=0), 1323, 1 134 cm' 1 (S- 
S0 2 ); 'H NMR (500 MHz, CDC1 3 , COSY) 5 1.94, 2.02, 2.02, 2.04, 2.04, 2.1 1, 2.13 (s x 7, 
3H x 7, Ac x 7), 3.29-3.40 (m, 2H, -CH 2 S-), 3.32 (s. 3H. CH 3 S0 2 -). 3.59 (ddd, J rs 9.9 
Hz, J y _ 6 - 4.9 Hz, J fJsr . 2.2 Hz, 1H, H-5'), 3.77 (t, J9.5 Hz. 1H, H-4'), 3.79-3.86 (m, 2H, 
10 H-5, -OCHH'-), 4.03-4.13 (m, 4H, H-6, H-6', H-6". -OCHFP-), 4.46 (d, J ( 2 7.8 Hz, 1H, 
H-l),4.50(d,J,, 2 8.0Hz, lH,H-l'),4.52(dd,J 5 . 6 . 2.2 Hz,J 6 ~. 6 - 11.9 Hz, lH,H-6" ! ), 
4.87 (dd, J vx 8.0 Hz, J Ty 9.6 Hz, 1H, H-2'), 4.93 (dd, J 2 3 10.5 Hz, J 34 3.5 Hz, 1H, H-3), 
5.08 (dd, J 1>2 7.8 Hz, J u 10.5 Hz, 1H, H-2), 5.17 (t, 7 9.3Hz, 1H, H-3'), 5.32 (dd, J 3 . 4 3.5 
Hz, J 4 5 1.0 Hz, 1H, H-4); !3 C NMR (125 MHz, CDC1 3 , DEPT) 5 20.5, 20.7, 20.8, 20.9 (q 
15 x 4, CH 3 COO- x 7), 36.0 (t, -CH 2 S-), 50.6 (q, CH 3 S0 2 -), 60.7, 61.6, 68.5 (t x 3, -OCH,-, 
C-6, C-6' ), 66.5, 69.0, 70.6, 70.9, 71.3, 72.6, 72.8, 76.0 (d x 8, C-2, C-3, C-4, C-5, C-2\ 
C-3\ C-4', C-5'), 100.7, 101.1 (d x 2, C-l, C-l'), 169.1, 169.7, 169.7, 170.1, 170.2, 
170.3, 170.4 (s x 7, CHjCOO- x 7); HRMS m/z (FAB+): Found 775.1793 (M+H + );' 
C 29 H 43 O 20 S 2 requires 775.1789. 

20 

Preparation of2-(4-0-$-D-Galactopyranosyl-$-D-glucopyranosyl)ethyl 
me thane thiosulfonate (If). 

A solution of NaOMe (0.1 M, 0.6 mL) was added to a solution of 4k (See 
25 Figure 7) (877 mg, 1.18 mmol) in MeOH (6 mL) under N 2 . After 3 hours, the reaction 
solution was passed through a Dowex 50W(H T ) plug (4 x 1 cm, eluant MeOH) and the 
solvent removed to give 2-bromoethyl 4-<9-P-D-galactopyranosyl-f3-D-glucopyranoside 
(4f) (See Figure 7) (476 mg, 90%) as a white foam which was used directly in the next 
step. NaSSO,CH 3 (185 mg, 1.38 mmol) was added to a solution of 4f (See Figure 7) (476 
30 mg, 1.06 mmol) in DMF (24 mL) under N 2 and warmed to 50°C. After 21 hours, the 
solution was cooled and the solvent removed. The residue was purified by flash 
chromatography (CHC1 3 : MeOH : AcOH : H,0, 60:30:3:5) to give If (See Figure 7) (346 



f 



mg, 68%) as a hygroscopic foam; [a] 28 D = + 1.5 (c 1.66, H 2 0); IR (KBr) 3416 cm" 1 (br, 
OH), 1311, 1131 cm" 1 (S-S0 2 ); 'HNMR (400 MHz, D 2 0 ? COSY) 5 3.10-3.13 (m, 1H, H- 
2), 3.30 (t, J 6.0 Hz, 2H -CH 2 S-). 3.31 (dd, J V2 7.8 Hz, J Y3 10.3 Hz, 1H, H-2'), 3.34 (s, 
3H, CH 3 SO r ), 3.38-3.54 (m, 5H), 3.44 (dd, J r y 10.3 Hz, J rA . 3.3 Hz, 1H, H-3'), 3.57 

5 (dd, J 8.4 Hz, J 11.4 Hz, 1H), 3.59 (dd, J 4.9 Hz, J 7.3 Hz, 1H), 3.70 (brd,J 3 ^ 3.3 Hz, 
1H, H-4'), 3.77 (dd, J 1.0 Hz, J 1 1.5 Hz, 1H), 3.79-3.83 (m, 1H, -OCHH'-), 3.97-4.02 (m, 
1H, -OCHH», 4.22 (d, J { , 7.8 Hz, 1H, H-l'), 4.33 (d, J L2 7.8 Hz, 1H, H-l); l3 C NMR 
(125 MHz, D 2 0) 5 36.7 (-CH.S-), 50.8 (CH 3 SO r ), 61.0, 62.1, 69.4, 69.6, 71.9, 73.5, 73.7, 
75.3, 75.9, 76.4, 79.3 (-OCH 2 -, C-2, C-3, C-4, C-5, C-6, C-2\ C-3\ C-4 7 , C-5\ C-6'), 

10 103.3, 103.9 (C-l, C-l'); HRMS m/z (FAB+): Found 503.0886 (M+Na^); C 15 H 28 0 13 S 2 Na 
requires 503.0869. 

Example 2 - General Procedure for Modification of Subtilisin Bacillus lentus 
("SBL") Mutants Stored as Flash-Frozen Solutions 

15 

A 1 .25 mL frozen aliquot of the mutant enzyme (SBL-N62C, -L217C, or 
-S166C) containing approximately 25 mg of enzyme was thawed and added to 1 .25 mL of 
Modifying Buffer (see below) in a polypropylene test-tube. To this solution was added 
100 jiL of a 0.2 M glyco-MTS reagent solution (la,g-k in MeCN, Ib-f in water (See 

20 Figure 8)). The mixture was sealed, vortexed, and placed on an end-over-end rotator at 
room temperature. When the modification was complete (determined by a specific 
activity assay, using succinyl-AlaAlaProPhe-j^-nitroanilide [e 4]0 = 8800 M* 1 cm" 1 ] 
(Bonneau et aL, "Alteration of the Specificity of Subtilisin BPN' by Site-Directed 
Mutagenesis in its SI and SI' Binding-Sites," J. Am. Chem. Soc, 1 19:1026-1030 (1991), 

25 which is hereby incorporated by reference) as substrate in 0.1 M Tris-HCl buffer 

containing 0.005% Tween 80, 1% DMSO, pH 8.6 showing constant activity and titration 
with Ellman's reagent (s 412 = 13600 M" 1 cm" 1 ) (Ellman et aL, Biochem. Pharmacol. , 7:88- 
95 (1961), which is hereby incorporated by reference) showing no free thiol present in 
solution), a further 50 j-iL of the modifying reagent solution was added and the mixture 

30 placed back on the end-over-end rotator for a further 10 minutes. The reaction was 

poured onto a pre-packed, pre-equilibrated G-25 Sephadex® PD10 column and eluted with 
3.5 mL Quench Buffer (see below). The eluant was dialysed at 4°C against 10 mM MES, 
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1 mM CaCl, pH 5.8 (2 x 1L, 2 x 45 minutes). The resulting dialysate was flash frozen in 
liquid nitrogen and stored at-18°C. 



Modifying Buffer: 



pH 9.5 
pH 7.5 
pH 6.5 
pH5.5 



140 mM CHES, 2 mM CaCl, 
140 mM HEPES. 2 mM CaCl 2 
140 mM MES, 2 mM CaCl, 
140 mM MES, 2 mM CaCl, 



Quench Buffer: Reactions at pH 7.5 - 9.5: 5 mM MES 1 mM CaCl, pH 6.5 
10 Reactions at pH 5.5: 5 mM MES 1 mM CaCl 2 pH 5.5 



The free thiol content of all chemically modified mutant enzymes ("CMMs"), was 
determined spectrophotometrically by titration with Ellman's reagent (Ellman et al., 
Biochem. Pharmacol. , 7:88-95 (1961), which is hereby incorporated by reference) in 

15 phosphate buffer 0.25 M, pH 8.0. In all cases, no free thiol was detected. Modified 

enzymes were analyzed by nondenaturing gradient (8-25%) gels at pH 4.2, run towards 
the cathode, on the Pharmacia Phast-system and appeared as a single band. Prior to ES- 
MS analysis, CMMs were purified by FPLC (BioRad, Biologic System, Hercules, CA) on 
a Source 15 RPC matrix (17-0727-20 from Pharmacia, Bridgewater, NJ) with 5% 

20 acetonitrile, 0.01% TFA as the running buffer and eluted with 80% acetonitrile, 0.01% 
TFA in a one step gradient. MS m/z (ES-MS): N62C-S-a (See Figure 8) calculated 
27049, found 27051; N62C-S-b (See Figure 8) calculated 26925, found 26928; N62C-S-C 
(See Figure 8) calculated 26925, found 26928; N62C-S-d (See Figure 8) calculated 
26925, found 26925; N62C-S-e (See Figure 8) calculated 26925, found 26925; N62C-S-f 

25 (See Figure 8) calculated 27087, found 27087; N62C-S-g (See Figure 8) calculated 

27093, found 27096; N62C-S-Et-fi-Glc(Ac), calculated 27009, found 27015; N62C-S-Et- 
(3-Glc(Ac) 3 calculated 27051, found 27053; N62C-S-i (See Figure 8) calculated 27093, 
found 27098; N62C-S-Et-(3-Gal(Ac) 3 calculated 2705 1, found 27051; N62C-S-k (See 
Figure 8) calculated 27381, found 27386; L217C-S-f5-Glc calculated 26882, found 26879; 

30 L217C-S-p-Glc(Ac), calculated 26966, found 26962; L2 1 7C-S-P-Glc(Ac) 3 calculated 
27008, found 27006; L217C-S-b (See Figure 8) calculated 26926, found 26928; L217C- 
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S-c (See Figure 8) calculated 26926, found 26925; L217C-S-d (See Figure 8) calculated 
26926, found 26925; L217C-S-e (See Figure 8) calculated 26926, found 26928; L217C- 
S-f (See Figure 8) calculated 27088, found 27087; L217C-S-Et-a-Glc(Ac) 2 calculated 
27010, found 27012; L217C-S-Et-p-Glc(Ac) 3 calculated 27052, found 27056; L217C-S- 
Et-a-Man(Ac), calculated 27052, found 27056; L217C-S-Et-P-Gal(Ac) 3 calculated 
27052, found 27053; L217C-S-Et-Lac(Ac) 6 calculated 27340, found 27342; S166C-S-a 
(See Figure 8) calculated 27076, found 27080; S166C-S-b (See Figure 8) calculated 
26952, found 26955; S166C-S-C (See Figure 8) calculated 26952, found 26950; S166C-S- 
d (See Figure 8) calculated 26952, found 26952; S166C-S-e (See Figure 8) calculated 
26952, found 26952; S166C-S-f (See Figure 8) calculated 271 14, found 271 12; S166C-S- 
Et-a-Glc(Ac) 3 calculated 27078, found 27078; S166C-S-Et-p-Glc(Ac) 2 calculated 27036, 
found 27040; S166C-S-Et-p-Glc(Ac) 3 (major) with S166C-S-h (See Figure 8) (minor) 
and S166C-S-Et-p-Glc(Ac) 2 (minor) calculated 27078 (major), 27120 (minor), 27036 
(minor), found 27081 (major), 27121 (minor), 27036 (minor); S 1 66C-S-Et-a-Man(Ac) 3 
calculated 27078, found 27085; S166C-S-Et-p-Gal(Ac) 3 calculated 27078, found 27079; 
S166C-S-Et-Lac(Ac) 5 calculated 27324, found 27331. 

Example 3 - General procedure for modification of SBL mutants stored as 
lyophilized powders 

This procedure was only used with S156C, which is stored as a lyophilized 
powder to prevent dimerization. Into a polypropylene test tube was weighed about 25 - 
30 mg of lyophilized S156C. This was dissolved in the following modifying buffers (2.5 
mL): 

pH9.5: 70 mM CHES, 2 mM CaCl, 
pH 7.5 : 70 mM HEPES, 2 mM CaCl 2 
pH6.5: 70 mM MES, 2 mM CaCl, 
pH 5 .5 : 70 mM MES, 2 mM CaCl, 

Glyco-MTS reagent was added and the reaction then proceeded as for the other mutants, 
using the appropriate quench buffer. MS m/z (ES-MS): S156C-S-a (See Figure 8) 
calculated 27076, found 27079; S156C-S-b (See Figure 8) calculated 26952, found 



26955; S156C-S-C (See Figure 8) calculated 26952, found 26952; S156C-S-d (See Figure 
8) calculated 26952, found 26952; S156C-S-e (See Figure 8) calculated 26952, found 
26952; S156C-S-f (See Figure 8) calculated 27114, found 271 15; S156C-S-g (See Figure 
8) calculated 27120, found 27123; S156C-S-h (See Figure 8) calculated 27120, found 
27122; S156C-S-i (See Figure 8) calculated 27120, found 27123; S156C-S-j (See Figure 
8) calculated 27120, found 27120; S156C-S-k (See Figure 8) calculated 27408, found 
27411. 

Example 4 - Contents of Acetylated Glyco-CMM Libraries 

The levels of acetylation of glyco-CMMs after modification of SBL 
cysteine mutants with la, g-k (See Figure 8) at various pH levels were determined and are 
set forth in Tables 1 and 2, below. 



Table 1. Levels of Acetylation of Glyco-CMMs after Modification of SBL Cysteine 
Mutants with la at various pH a 



\Reagent 


la 


la 


la 


Enzyme\ 


pH 9.5 


pH 7.5 


pH5.5 


N62C 


4 




4 


S156C 


4 




4 


S166C 


4 




4 


L217C 


0 


2 b 


3 b 



Isolated as single species unless indicated. b Single product mass by ES-MS. 



Table 2. Levels of Acetylation of Glyco-CMMs after Modification of SBL Cysteine 
Mutants with la,g-k at pH 5.5 a 



\Reagent 
Enzyme\ 


la 


lg 


lh 


li 


lj 


lk 


N62C 


4 


4 


-ib 
S> 


4 


J 


7 


S156C 


4 


4 


4 


4 


4 


7 


S166C 


4 


-> 
J 


3 c ,4 d ,2 d 


3 b 


ob 


5 b 


L217C 


3 b 


2 b 


3 b 


3 b 


3 b 


6 b 



a Isolated as single species unless indicated. 
b Single product mass by ES-MS. 
c Major component. 
d Minor component. 



Example 5 - Incubation of L217C-S-(5-Gk(Ac) 3 at pH 9.5 

The general procedure for modification of SBL mutants stored as flash- 
frozen solutions was used to incubate 1 .26 mg of L217C-S-j3-Glc(Ac) 3 as a 0.5 mL 
aliquot in the absence of MTS reagent for 2 hours to give L217C-S-P-Glc as the sole 
product. MS m/z (ES-MS): L217C-S-P-Glc calculated 26882, found 26885. 

Example 6 — Active Site Titrations 

The active enzyme concentration was determined as previously described 
(Hsia et al., "Active-Site Titration of Serine Proteases Using a Fluoride-Ion Selective 
Electrode and Sulfonyl Fluoride Inhibitors," Anal. Biochem. , 242:221-227 (1996), which 
is hereby incorporated by reference) by monitoring fluoride release upon enzyme reaction 
with a-toluenesulfonyl fluoride (PMSF) as measured by a fluoride ion sensitive electrode 
(Orion Research 96-09). The active enzyme concentration determined in this way was 
used to calculate k, M values for each CMM. 
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Example 7 - Kinetic Measurements 

Michaelis-Menten constants were measured at 25(± 0.2)° C by curve 
fitting (GraFit® 3.03, Erithacus Software Ltd., Staines, Middlesex, UK) of the initial rate 
5 data determined at nine concentrations (0.125 mM-3.0 mM) of succinyl-AAPF-/?NA 
substrate in 0.1 M Tris-HCl buffer containing 0.005% Tween 80, 1% dimethylsufoxide 
("DMSCT), pH 8.6 (e 410 = 8800 M" 1 cm' 1 ) (Bonneau et al., J. Am. Chem. Soc , 1 19:1026- 
1030 (1991), which is hereby incorporated by reference). 

10 Example 8 - Controlled Site Selective Glycosylation of Proteins by a Combined Site- 
Directed Mutagenesis and Chemical Modification Approach 

Four SBL sites at different locations and of different characteristics were 
selected for mutation to cysteine in order to provide a broad test of the glycosylation 

15 methodology. S156 of the S, -pocket (Nomenclature of Schechter; Berger, Biochem. 

Biophvs. Res. Commun. , 27:157-162 (1967), which is hereby incorporated by reference) 
is a surface-exposed residue that permits the introduction of externally-disposed glycans 
mirroring those found naturally in glycoproteins ( Molecular Glycobiology, Fukuda et al., 
Eds. ? Oxford University, Oxford (1994), which is hereby incorporated by reference). In 

20 contrast, N62 in the S 2 pocket, SI 66 in the S, pocket, and L217 in the S/ pocket have side 
chains which are internally oriented and test the applicability of the method for 
introducing sugars at hindered locations. Broad applicability with respect to the sugar 
moiety was evaluated by using the representative series of protected and deprotected, 
mono- and disaccharide methanethiosulfonates ("MTS") la-k (see Figure 8). These were 

25 prepared from their parent carbohydrates in good to excellent yields (Figures 6 (Reagents 
and Conditions: (i) Ac 2 0, py then HBr, AcOH; (ii) NaSS0 2 CH 3 , EtOH, 90° C; 
(iii) Br(CH 2 ) 2 OH, BF 3 .Et 2 0 then Ac 2 0, py; (iv) NaSS0 2 CH 3 , DMF, 50° C; (v) NaOMe, 
MeOH; (vi) Ac 2 0, py then Br(CH 2 ) 2 OH ? BF 3 .Et 2 0, DCM) and 7 (Reagents and 
Conditions: (i) Ac 2 0, py. 92% for 5d, 99% for 5e; Ac 2 0, NaOAc, 82% for 5f; 

30 (ii) Br(CH 2 ) 2 OH, BF 3 .Et 2 0, DCM, 70% for 4i, 67% for 4j, 53% for 4k; (iii) NaOMe, 

MeOH, 96% for 4d, 92% for 4e, 90% for 4f; (iv) NaSS0 2 CH 3? DMF, 50° C, 80% for Id, 
71% for le, 68% for If, 88% for li, 83% for lj, 88% for lk)). Two types of 
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glycosylating reagents, the anomeric methanethiosulfonate la and the ethyl-tethered 
methanethiosulfonates lbx,g,h, were prepared from D-glucose (2a, Figure 6). The 
preparation of these reagents in fully protected la,g,h and deprotected lb,c forms allowed 
the effects of increased steric bulk and hydrophobicity to be assessed. Untethered MTS 
5 reagent la was readily prepared from acetobromoglucose (3) using NaSS0 2 CH 3 as shown 
in Figure 6 (Prepared from D-glucose according to Scheurer et al., J. Am. Chem. Soc , 
76:3224 (1954), which is hereby incorporated by reference). For the preparation of lb,g, 
an a-linked ethyl tether was introduced using Fischer glycosidation of D-glucose (2) with 
2-bromoethanol. Treatment of the tetraacetylbromide 4g with NaSS0 2 CH 3 allowed the 
10 preparation of the peracetylated a-gluco-MTS Ig in an excellent 90% yield. Zemplen 
deacylation (Zemplen et al., Ber. Dtsch. Chem. Ges. , 56:1705-1710 (1923), which is 
hereby incorporated by reference) of bromide 4g and subsequent displacement of bromide 
by methanethiosulfonate ion proceeded smoothly to yield the fully deprotected a-gluco- 
MTS lb in 69% yield. The (3-D-gluco-MTS reagents 1c and lh, which are epimeric at C- 
15 1 relative to lb and Ig, respectively, were prepared from the corresponding peracetylated 
(3-bromide 4h. The preparation of 4h took advantage of well-defined methodology 
utilizing Lewis acid catalyzed displacement of anomeric acetates by alcohols (Dahmen et 
al, "2-Bromoethyl Glycosides - Synthesis and Characterization," Carbohydr. Res. , 
1 16:303-307 (1983), which is hereby incorporated by reference). The protected bromide 
20 4h was elaborated to the corresponding peracetylated (lh) and deprotected (lc) p-gluco- 
MTS reagents in an essentially identical manner to that used for the epimeric a-gluco- 
MTS reagents. Thus, using NaSS0 2 CH 3 , 4h gave lh in 78% yield and, following 
deprotection, 4c (Helferich et al, Just. Lieb. Ann. Chem. , 541:1-16 (1939), which is 
hereby incorporated by reference) afforded lc in 68% yield from 4h. Parallel routes 
25 allowed similarly efficient access to the a-D-manno-MTS reagents Id and li, which are 
epimeric at C-2 relative to lb and lg, respectively, and the (3-D-galacto-MTS reagents le 
and lj, epimeric at C-4 relative to lc and lh, respectively (Figure 7). The ready 
adaptability of this method to oligosaccharides was illustrated by the preparation of the 
peracetylated (lk) and fully deprotected (If) disaccharide lacto-MTS reagents, in good 
30 overall yields from lactose (2f) of 38% and 27% respectively without cleavage of the 
interresidue bond. 
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Example 9 - Site Specific Glycosylation. 

The glyco-MTS reagents la-k (See Figure 8) were reacted with the chosen 
cysteine mutants SBL-N62C, -S156C, -S166C and -L217C in aqueous buffer under 
conditions described previously (Stabile et al., Bioorg. Med. Chem. Lett. , 6:2501-2512 
(1996); Berglund et al., J. Am. Chem. Soc , 1 19:5265-5266 (1997); DeSantis et al.. 
Biochem. , 37:5968-5973 (1998), which are hereby incorporated by reference). These 
reactions were rapid and quantitative, as judged by monitoring of changes in specific 
activity and by titration of free thiols with Ellman's reagent (Ellman et al., Biochem. 
Pharmacol , 7:88-95 (1961), which is hereby incorporated by reference). The 
glycosylated chemically modified mutants (CMMs) were purified by size-exclusion 
chromatography and dialysis, and their structures were confirmed by rigorous ES-MS 
analyses (± 7 Da) as shown in Table 3 below: 
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Table 3. 



Properties and Kinetic Parameters' 1 of Modified Enzymes 



Entry 


Reactant Pocket 


MTS 


Reacn. pH 


Product(s) 




K u (mM) 


KJKu 




Enzyme 


Reagent 










(s ,! mM-') 


1 


SBL-WT 


■ 


* 


■ 


1 53 ± 4 


0.73 + 0 05 


209 ± 1 5 


2 


N62C S, 


- 


■ 


- 


174 + 9 


1.90 + 0 20 


92 ± n 


3 




la 


9 5 


N62C-S~a b 


67 9 ± 3 5 


0.52 ±0 07 


130 6 ± 18.8 


4 




lb 


6 5 


N62C-S-b b 


135 3 ±3 5 


0.94 ± 0.05 


I43 9±8 5 


5 




1c 


6 5 


N62C-S-C* 


132 7±40 


1.25 ±0 08 


1 06 2 ± 7 5 


6 




Id 


65 


N62C-S-d b 


I32 9±3.i 


1.04 ±0 05 


127 8 ±6.8 


7 




le 


6 5 


N62C-S-e h 


1193 + 3 6 


0 99 ± 0 07 


120.5 ±93 


8 




If 


65 


N62C-S-I* 


129S±24 


L04±004 


124 8 ±5.3 


9 




lg 


5.5 


N62C-S-g t> 


120 0±2 7 


0.52 ± 0 03 


230 8 ± 14 3 


10 




Ih 


65 


N62C-S-Et-^-Glc(Ac) 2 c 


87.7±4 2 


L63±0 15 


53 8 ± 5 8 


11 




Ih 


5 5 


N62C-$-Et-£-Glc{Ac) 3 c 


1003 ±3 5 


l.86±0 12 


53 9 ± 4 0 


12 




li 


5.5 


N62C-S-i b 


123 0± 1 6 


1.05 ±0 03 


117.1 ±3.7 


13 




U 


5 5 


N62C-S-Et-(3-Gal(Ac), c 


103 4±4 3 


2.36 ± 0 17 


43 8 ± 3 6 


14 




Ik 


5 5 


N62C-S-k b 


64.9 ± 1 5 


0.88 ±0.05 


73.8 ±4 5 


15 


L217C S,' 


- 


- 


- 


41 ± 1 


0.80 ±0.04 


51+3 


16 




la 


95 


L2I7C-S-p-Glc b 


27.7±04 


0.79 ± 0 03 


35 1 ± i 4 


17 




la 


7.5 


L217C-S-P-Glc(Ac), c 


44 9 ± 2.0 


0.44 ±0 06 


102 0± 14.6 


18 




la 


5.5 


L217C-S-p-Glc(Ac), c 


36.3 ± 0.8 


0.36 ±0 03 


100 8± 8 7 


19 




lb 


65 


L217C-S-b b 


57.8 ±0.6 


0 67 ± 0 02 


86 3 ±2.7 


20 




1c 


65 


L217C-S-c b 


50.6±0 9 


0.67 ± 0 03 


75.5 ±3 6 


21 




Id 


6.5 


L217C-S-d b 


62.0 ± i 3 


0.55 ±0.03 


1 12.7 ±66 


22 




le 


65 


L217C-S-e b 


46.2 ±08 


0.63 ± 0 08 


73.3 ±3 7 


23 




If 


65 


L2l7C-S-f 


30.4±0 6 


0.46 ±0 03 


66 1 ±4.5 


24 




lg 


5 5 


L217C-S-Et-a-Glc(Ac), c 


72.7 ±3 1 


0.73 ±0 08 


99 6± 11.8 


25 




Ih 


5 5 


L217C-S-Et-P-Glc(Ac) 3 c 


29 4 ±0.8 


0.93 ± 0.06 


31.6±2.2 


26 




Ii 


5 5 


L217C-S-Et-a-Man(Ac)- c 


97 8 ±24 


0 59 ±0.04 


165 8 ± 12.0 


27 




lj 


5 5 


L217C-S-Et-p-Gal(Ac) 5 c 


39 2 ± 0 8 


1.17 ±0.05 


33.5 ± 1.6 


28 




Ik 


5 5 


L217C-S-Et-Lac(Ac) 6 c 


27 1 ±06 


0.69 ±0.04 


39 3 ±2.4 


29 


S156C S, 


- 


- 


- 


125 + 4 


0 85 + 006 


147 ± 11 


30 




la 


9 5 


S156C-S-a b 


54 8 ± 1 3 


0.70 ± 0.04 


78.3 ±4.8 


31 




lb 


6.5 


SI56C-S-b b 


77.0 ± 1.2 


0.84 ± 0 03 


91.7±3 6 


32 




lc 


6.5 


S156C-S-c b 


76.6 ± 1 7 


0.73 ± 0.04 


104.9 ±6.2 


33 




Id 


6.5 


S156C-S-d b 


88.6 ±2 8 


0.79±0.06 


112.2 ±9.2 


34 




le 


< < 




78.9 ± 1 9 


u.oy ± U.U4 


89.7 ±44 






i f 


f, ^ 
O J 




0-3.0 3: I .4 


\j.oy x u.uj 


71 


JO 




1 a 


5 D 








55 9 ± 3 0 


37 




1 h 




1 -J KJ\_s \J It 


64 0 ± 1 3 


n 77 -+• 0 04. 


88 9 ± 5.2 


38 




1 j 


5.5 


S 1 56C-S-i b 


60.3 ± 0.9 


0.7 1±0 03 


84.9 ± 3.8 


39 




li 


5 5 


Si56C-S-j b 


5 1 .9 ± 0 6 


0.61±0 02 


85 1 ±3.0 


40 




lk 


5 5 


Sl56C-S-k b 


53.6±0 8 


0.79±0.03 


67.4 ±2.8 


41 


S166C S, 








42+ 1 


0.50 + 0.05 


84 + 9 


42 




la 


95 


Sl66C-S-a b 


33.8 ± 1.3 


0.66 ± 0.06 


51.2±5.0 


43 




lb 


6.5 


SI66C-S-b b 


8I.9± 1.1 


1.14 ± 0 03 


71 8 ±2.1 


44 




lc 


65 


S166C-S-c b 


67.0 ±2.2 


0.99 ± 0.07 


67.6 ±5.3 


45 




Id 


65 


S166C-S-d b 


76.5 ±20 


1.17 ± 0.07 


65 4 ±4.3 


46 




le 


6 5 


S166C-S-e b 


62.2 ± 1 4 


1.08 ±0.05 


57 6 ±3 0 


47 




If 


6.5 


S166C-S-* 1 


58.2 ± 1 2 


1.02 ±0 04 


57.1±2.5 


48 




lg 


5 5 


S166C-S-Et-a-Glc(Ac) 3 c 


31 0±08 


0.77 ±0.05 


40.3 ± 2.8 


49 




lh 


6 5 


S166C-S-Et-£-Glc{Ac), c 


95.0 ±2 1 


0 87 ± 0 05 


109.2±6 7 










S166C-S-Et-(3-Glc(Ac) 3 d 








50 




Ih 


5 5 


S166C-S-Et-(K}lc(Ac) 7 c 


72.9 ± 1 7 


0 65 ± 0 04 


112.2 ±7.4 










S166C-S-h c 








51 




li 


5.5 


S166C-S-Et-a-Man(Ac) 3 c 


67.7 ± 1 9 


1.64 ±0.09 


41 3 ±2.5 


52 




lj 


5 5 


$166C-S-Et-£-Gal<Ac) 3 d 


65.1 ±0.9 


0.80 ± 0 03 


81.3 ± 3.3 


53 




lk 


5 5 


S166C-S-Et-Lac(Ac)5 d 


67 4 ± 1.6 


1.65 ±0.07 


40 8 ±2.0 


3 Michaelis-Menten constants were measured at 25 °C according to the initial rates method 


m 0.1 M Tris-HCl buffer at pH 8.6. 


0,005% Tween 80. 1% DMSO. suc-AAPF-^NA as the substrate 









b Single species. 
c Single product mass by ES-MS. 
d Major component. 
0 Minor component. 
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The CMMs each appeared as a single band on non-denaturing gradient PAGE, thereby 
establishing their high purities. The active enzyme concentration of the resulting CMM 
solutions was determined by active site titration with a-toluenesulfonyl fluoride (PMSF) 

5 using a fluoride ion-sensitive electrode (Hsia et al. ; "Active-Site Titration of Serine 
Proteases Using a Fluoride-Ion Selective Electrode and Sulfonyl Fluoride Inhibitors," 
Anal. Biochem. , 242:221-227 (1996), which is hereby incorporated by reference). In all 
cases, modification with the fully deprotected reagents lb-f (See Figure 8) led to site- 
specific glycosylations and the formation of single glycoforms. Furthermore, 

10 modification with the protected MTS reagents la, g-k (See Figure 8) gave products with 
controllable levels of acetylation. Through adjustment of pH and appropriate selection of 
the glycosylation site, differently acetylated glycoforms of SBL were prepared. This 
ability to modulate the level of acetylation through pH-control vastly expands the 
structural variety of glyco-CMMs that can be conveniently accessed and its scope was 

1 5 probed through the reaction of la (See Figure 8) with SBL-N62C, -S 1 56C, -S 1 66C, - 
L217C (Figure 9). 

The extent of deacetylation during modification was highly site-dependent. 
Modification of L217C with reagent la (See Figure 8) at pH 9.5 was accompanied by 
complete in situ deacetylation, and the sole product was the fully deprotected 

20 glucosylated-SBL, L217C-S-P-Glc. In contrast, treatment of N62C, S156C, and S166C 
with la (See Figure 8) at pH 9.5 yielded only fully acetylated products, N62C-S-a (See 
Figure 8), S156C-S-a (See Figure 8), and S166C-S-a (See Figure 8), respectively. To 
examine the effects of pH upon deacetylation, the reaction of L217C with la (See Figure 
8) was chosen. At pH 7.5 and 5.5, the products retained two and three acetate groups, 

25 forming L217C-S-p-Glc(Ac), and L217C-S-p-GIc(Ac) 3 , respectively. In all cases, 
complete integrity of the site selectivity was retained. 

This valuable site-dependent deacetylation was attributed to a novel 
intramolecular SBL-catalyzed process. Although acetate esters are moderately 
chemically labile in aqueous solution at pH 9.5, they are not at either pH 7.5 or 5.5 

30 (Greene et al., Protective Groups in Organic Synthesis , 2nd ed. New York, Wiley (1991), 
which is hereby incorporated by reference). The striking differences in behavior during 
modification between L217C and the three other mutants N62C. S156C, and S166C 



under identical reaction conditions discounted the possibility of deacetylation prior to 
modification. In addition, it was noted that position 217 bears an internally-oriented side 
chain and that modification of surface exposed position 156 showed no sign of 
deacetylation. This observation discounted both the possibility of either in situ chemical 
deacetylation or intermolecular enzymatic deacetylation. Furthermore, this ability of SBL 
to intramolecularly deacetylate was confirmed by the reaction of L217C-S-p-Glc(Ac) 3 at 
pH 9.5. Incubation of L217C-5-p-Glc(Ac) 3 under standard modification reaction 
conditions, but without reagent la (See Figure 8), gave L217C-S-p-Glc as the sole 
product (Figure 9). 

The enormous potential of this method was demonstrated by the 
preparation of a small library of differently acetylated glycosylated CMMs through the 
reaction of SBL-N62C, -S156C, -S166C, and -L217C with MTS reagents Ig-k (See 
Figure 8). Using the pH-activity profiles of wild-type ( t% WT") and CMMs of SBL as a 
guide, pH 5.5 and 6.5 were chosen to minimize deacetylation. Typically, the specific 
activity of SBL and its CMMs drops sharply below pH 7.5 to levels that at pH 5.5 are 5- 
20% those at optimal pH (8.5-9.5) (Desantis et ah, ^Chemical Modifications at a Single 
Site Can Induce Significant Shifts in the pH Profiles of a Serine Protease " J. Am. Chem. 
Soc , 120:8582-8586 (1998), which is hereby incorporated by reference). As expected, 
this drop in hydrolytic activity was reflected in the products of these modifications with 
lg-k (See Figure 8), which in all cases retained two or more acetate groups. 

For example, at pH 5.5 reactions of SBL-L217C and -S166C created 
singly deacetylated CMMs, with the exception of pure dideacetylated glyco-CMMs 
L217C-S-Et-a-Glc(Ac) 2 , and S166C-S-Lac(Ac) 5 . The formation of the latter may reflect 
the presence of two primary acetates in disaccharidic MTS reagent lk (See Figure 8). 
Primary acetate groups are typically more labile than secondary acetate groups under 
conditions of intermolecular enzymatic deacetylation (Bashir et aL, ''Enzymatic 
Esterification and De-Esterification of Carbohydrates - Synthesis of a Naturally- 
Occurring Rhamnopyranoside of P-Hydroxybenzaldehyde and a Systematic Investigation 
of Lipase-Catalyzed Acylation of Selected Arylpyranosides/' J. Chem. Soc. Per-kin - 
Trans. I , 2203-2222 (1995), which is hereby incorporated by reference). In contrast, 
reactions of SBL-S156C gave only the fully acetylated CMMs, S156C-S-g~k (See Figure 
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8). This uniform lack of deacetylation observed for surface exposed glycans at position 
156 was consistent with an intramolecular enzyme-catalyzed mechanism requiring 
internally -oriented acetate groups. Interestingly, the reactions of SBL-N62C were also 
determined by the anomeric configuration of Ig-k (See Figure 8) with a-MTS reagents 

5 lg,i (See Figure 8) giving products that retained all acetate groups while P-MTS reagents 
lh,j,k (See Figure 8) were monodeacetylated. The range of accessible acetylated glyco- 
CMMs was further extended through modification at pH 6.5. For example, at position 62, 
it allowed the introduction of diacetylated (3-glucose, forming N62C-S-Et-p-Glc(Ac) 2 , in 
place of the triacetylated N62C-S-Et-p-Glc(Ac) 3 formed at pH 5.5. The range of 

10 acetylation at position 1 66 was similarly expanded through the formation S 1 660S~Et-p~ 
Gic(Ac) 2 in place of S166C-S-Et-P-Glc(Ac) 3 . 

Example 10 - Glycan Structure-Hydrolytic Activity Relationships 

15 The effects of glycosylation upon SBL were assessed by the determination 

of k cat and K M for the hydrolysis of succinyl-AAPF-p-nitroanilide (Suc-AAPF-/?NA) at 
pH 8.6. The kinetic parameters of the 48 CMMs generated were compared with those of 
WT and unmodified mutants in Table 3. The excellently selective and controlled method 
shown in Figure 8 allowed the introduction of structurally related monosaccharides, D- 

20 glucose, D-galactose, and D-mannose, in addition to the more stericaily bulky 

disaccharide lactose. From the resulting glycosylated CMMs, a detailed and precise set of 
structure-activity relationships was generated (Figures 10-12). 

At position 62, in the S 2 pocket, the 2.3-fold reduction in k c JK Ki caused by 
mutation to cysteine was partially restored by glycosylation (Figure 10A). The 

25 introduction of ethyl-tethered a- or p-glucose, p-galactose or a-mannose to N62C 

increased k c JK M and formed N62C-S-b-e (See Figure 8) with k c JK M s 1.5- to 2-fold lower 
than WT. Despite its steric bulk and high hydrophilicity, disaccharidic lacto-CMM 
N62C-S-f (See Figure 8) also showed higher activity than N62C with kJK tX! only 1.7-fold 
lower than WT. 

30 The effects of the mutation of position 217 in the S, f pocket were 

intrinsically more dramatic as indicated by a value of k c JK M iox L217C that is 4-foid 
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lower than WT (Figure 10B). The introduction of deprotected untethered glucose, 
forming L217C-S-P-Glc, lowered k c JK M further to 6-fold lower than WT. In contrast, 
glycosylation of position 217 with ethyl tethered MTS reagents lb-f (See Figure 8) 
restored activity and k c JK u s for L217C-S-b-f (See Figure 8) were similar to each other in 
the range 2.5- to 3.1 -fold lower than WT. This striking difference between tethered 
L217C-S-b-f (See Figure 8) and untethered L217C-S-P~Glc illustrated that SBL tolerates 
the replacement of hydrophobic Leu with highly hydrophilic carbohydrate moieties when 
they are linked by a hydrophobic ethyl spacer group better than directly-linked Cys-S-(3- 
Glc. This may indicate that a structural requirement for efficient amidase activity is a 
closely-bound hydrophobic residue in the S/ subsite of SBL and contrasts sharply with 
the excellent enhancement of esterase activity caused by the same Cys-S~j3-Glc 
substitution. 

Mutation of position 156 in the S, pocket to cysteine caused a 1.4-fold 
drop in k c JK M (Figure IOC). Subsequent introduction of deprotected S-Et-a-Glc, side 
chain b (See Figure 8), resulted in a k c JK M for S156C-S-b (See Figure 8) that was 2.3- 
fold lower than WT. From S156C-S-b to -f (See Figure 8), k c JK^ varied in an arced 
mariner peaking at 1 .9-fold lower than WT for S 1 56C-S-d (See Figure 8) and then 
decreasing monotonically to a k c JK M for S156C-S-f (See Figure 8) that was 3-fold lower 
than WT. The similar K sl values for these SI 56 CMMs to those of SBL-WT were 
indicative of these modifications having little effect upon ground state binding and were 
consistent with the surface exposed orientation of the SI 56 side chain. 

At position 166, in the S, pocket, the 2.5-fold decrease in k c JK M caused by 
mutation to cysteine was amplified by modification with lb (See Figure 8) and led to a 
kJK KJ value 3-fold lower than WT for S166C-S-b (See Figures 8 and 10D). From 
S166C-S-b to -f (See Figure 8), k c JK M decreased monotonically to a k c JK M for S166C-S- 
f (See Figure 8), in which the S 1 binding site was occupied by the sterically bulky 
disaccharide lactose, that was 3.8-fold lower than WT. 
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Example 11 - Kinetic Effects of Glycosylation with Acetylated Carbohydrates. 

The enormous potential of the controlled site-selective glycosylation 
approach depicted in Figure 8 was illustrated by the great variety of changes in k c JK u thai 
were caused by the introduction of acetylated side chains a,g-k (See Figure 8) to SBL. 
These dramatic changes contrast with the slight variations found for deprotected side 
chains b-f (See Figure 8). For example, at position 62, an alternating decrease-increase 
pattern was observed (Figure 1 1 A). This resulted in a k c JK KI for tetraacetylated a-gluco- 
CMM N62C-S-g (See Figure 8) that was 1 . 1 -fold higher than WT. Similar alternating 
patterns were also seen at positions 217 (Figure 1 IB) and 166 (Figure 1 ID). At position 
156, variations were slight, which was consistent with its surface exposed orientation 
(Figure 11 C). 

To examine the cause of these variations, the k c JK iU s of acetylated 
glycosylated CMMs were compared with those for deprotected glycosylated CMMs with 
the same glycan structure and stereochemistry (Figure 12). This separated the effects of 
acetylation from the effects of glycosylation and allowed the underlying effects of 
modification to be dissected. It was clear from Figure 12 that the anomeric 
stereochemistry of the acetylated glycans modulates k c JK M . 

For example, at position 62 (Figure 12 A) comparison of N62C-S-b,c (See 
Figure 8) with N62C-S-g (See Figure 8) and N62C-S-Et-f}-Glc(Ac) 2J showed that 
increasing the number of acetate groups from zero to four, from N62C-S-b (See Figure 8) 
to N62C-S-d (See Figure 8), increased k c JK M 1.6-fold for the oc-gluco side-chain b (See 
Figure 8). In contrast, increasing the number from zero to two or three, from N62C-S-C to 
N62C-S-Et-3-Glc(Ac) 2or3 , was detrimental for the p-gluco side-chain c (See Figure 8) and 
led to a 2-fold decrease. Similarly, N62C-S-Et~P-Gal(Ac) 3 displayed a distinctly lower 
kJK u than N62C-S-e (See Figure 8) that was 5-fold lower than WT. These changes in 
k u JK S} upon acetylation were manifested largely through increased or decreased ground 
state binding, of which the most striking example was a K Af for N62C-S-Et-p-Gal(Ac) 3 
that was 2.4-fold higher than deprotected galacto-CMM N62C~S-e (See Figure 8). 

At position 217, control of the level of acetylation through pH, as shown in 
Figure 9, had allowed the introduction at position 217 of untethered (3-D-glucose bearing 
zero, two, and three acetate groups. As Figure 12B illustrates, the addition of two or three 
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acetate groups restored kJK u from 6-fold lower than WT for L217C-S-p-Glc to 2-fold 
lower than WT for L217C-S-p-Glc(Ac) 2 or L217C-S-p-Glc(Ac)>. This showed that 
acetylation allowed fine-tuning of activity and paralleled increases in the esterase k L JK hf s 
of these CMMs. 

The same trend in k c JK u was observed for the L217C ethyl tethered 
CMMs as at position 62: acetylation was beneficial to a-tethered but detrimental to p- 
tethered CMMs. For example, increasing the number of acetate groups in the a-linked 
glucose moiety from zero to two, i.e., from L217C-S-b (See Figure 8) to L217C-S-Et-a- 
Glc(Ac) 2 , increased k c JK u to 2-fold lower than WT. In contrast, increasing the number of 
acetate groups in the epimeric p-linked moiety from zero to three, i.e., from L217C-S-C 
(See Figure 8) to L217C-S-Et-p-Gle(Ac) 3 , halved kJK Ki to 6-fold lower than WT. 
Similarly, the k c JK M of a-linked L217C-S-Et-a-Man(Ac) 3 was 1 .5-fold higher than the 
corresponding deprotected L217C-S-d (See Figure 8), while p-linked L217C-S-Et-P- 
Gal(Ac) 3 was 2-fold lower than the corresponding deprotected L217C-S-e (See Figure 8). 

Consistent with its surface exposed orientation, the changes at position 156 
caused by acetylation were slight and no variation with anomeric stereochemistry was 
seen (Figure 12C). Increasing the number of acetate groups from zero to four, from 
S156C-S-b-e (See Figure 8) to S156C-S-g-j (See Figure 8), decreased kJK M hy 1.05- to 
1 .6-fold. The most sterically bulky lactose side chain (-k) (See Figure 8) gave rise to the 
lowest k Ll JK Si at position 156, 3.2-fold lower than WT, and indicated that even at the 
surface of SBL the introduction of sterically bulky groups still allowed tailoring of 
k c JK M . 

At position 166, the effects of increased acetylation were, as at positions 
62 and 217, modulated by glycan anomeric configuration (Figure 12D). However, the 
direction of these increases and decreases was reversed: acetylation was beneficial to P- 
tethered but detrimental to a-tethered CMMs. For example, the a-tethered S166C-S-Et- 
a-Glc(Ac) 3 had a 1.8-fold lower k c JK M value than the corresponding deprotected S166C- 
S-b (See Figure 8), while p-linked CMMs S166-S-Et-p-Glc(Ac) 2J had 1 .6-fold higher 
KJK M values than the corresponding fully deprotected S166-S-C (See Figure 8); Again, 
these variations were largely manifested through changes in ground state binding. For 
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example, AT V/ increased 1.4-fold from S166C-S-Et-a-Man (-d) (See Figure 8) to S166C-S- 
Et-a-Man(Ac) 3 . 

It should be noted that the changes in activity of the lacto-CMMs upon 
acetylation fell largely outside of these trends and at all four positions acetylation of the 
bulky, disaccharidic side-chain f (See Figure 8) caused a general decrease in k L JK u s. For 
example, at position 62 heptaacetylation, from N62C-S-f (See Figure 8) to N62C-S-k 
(See Figure 8), resulted in a lowering of k c JK u to a value that is 3-fold lower than WT. 
Despite the greater steric bulk of side chain k (See Figure 8), this drop was a consequence 
of a lower k can 2-fold lower than N62C-f (See Figure 8), rather than a higher K u . In fact, 
the K Xi value of N62C-S-R (See Figure 8) was 1 .3-fold lower than N62C-5-f (See Figure 
8). Similarly, L217C-S-Et-Lac(Ac) 6 and S166C-S-Et-Lac(Ac) 5 had 1.7-fold and 1.4-fold 
lower k c JK A js than the corresponding deprotected CMMs, respectively. 

In summary, the strategy of site-directed mutagenesis combined with 
chemical modification was exploited for the site-selective glycosylation of SBL. This 
method was general, versatile and allowed the preparation of pure glycoforms which 
constitute the first examples of regio- and glycan- specific protein glycosylation at 
predetermined sites. Careful control of a novel SBL-catalyzed intramolecular 
deacetylation greatly expanded the scope of this method and through reaction of SBL- 
N62C, -S156C, -S166C, and -L217C with peracetylated MTS reagents la,g-k (See Figure 
8) allowed the introduction of glycans with precisely modulated levels of acetylation. 

The glycosylated CMMs formed display k c JK M values that ranged from 
1 . 1 -fold higher than WT to 7-fold lower than WT. Without the use of this highly 
selective glycosylation technique, the determination of such precise trends would be 
unachievable and variations caused by previous non-specific glycosylation could only be 
interpreted in a general manner. It has been demonstrated that subtle differences in 
carbohydrate structure may be used to fine tune the activity of SBL. For example, the 
anomeric stereochemistry of the glycans introduced modulated changes in k c JK A/ upon 
acetylation. At positions 62 and 217, acetylation enhanced the activity of a-tethered 
CMMs but decreased that of (3-tethered. This trend was reversed at position 166 where, in 
contrast, acetylation enhanced the k ca /K M s of p-tethered CMMs but decreased those of oc- 
tethered. Consistent with its surface exposed nature, changes at position 156 were more 
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modest, but still allowed control of activity particularly through glycosylation with 
disaccharide lactose. These results illustrated the great potential for tailoring activity 
through the correct choice of glycan and glycosylation site. 

The ability of the glycosylation method to glycosylate the binding pockets 

5 of SBL also creates opportunities to broaden its substrate specificity. For instance, an 
array of hydrogen bonding hydroxyl groups may broaden its specificity towards hydrogen 
bonding substrates such as glycosylated amino acids. Subtilisins have been elegantly 
used to catalyze the synthesis of glycopeptides (Witte et ah, "Solution- and Solid-Phase 
Synthesis of N-protected Glycopeptide Esters of the Benzyl Type as Substrates for 

10 Subtilisin-Catalyzed Glycopeptide Couplings;' J. Am. Chem. Soc. 120:1979-1989 

(1998); Wong et al, "Enzymatic-Synthesis of N-Linked and O-Linked Glycopeptides," J. 
Am. Chem. Soc. 1 15:5893-5901 (1993), which are hereby incorporated by reference). 
However, the natural specificity of these enzymes has limited these peptide ligations to 
those in which the glycosylated residues are at least one residue distant (P 2 ,P 3 . . . or 

15 P 2 ' JY * . -) from the amide bond formed. For example, while ligation of Z-Gly-OBz with 
H-Gly-Ser(Ac 3 GlcNAcP)-NH 2 was successful, no yield of product was obtained with H- 
Ser(Ac 3 GlcNAc[3)-NH 2 (Witte et al., J. Am. Chem. Soc . 120:1979-1989 (1998), which is 
hereby incorporated by reference). The introduction of sugars to the Sj and S,' subsites as 
hydrogen bonding groups demonstrated here may enhance the specificity of proteases 

20 towards hydrophilic substrates. 

Furthermore, by choosing carbohydrate attachments that differ from each 
other at only one stereocenter, SARs may be determined by examining changes in activity 
as the nature of sugar side chain is varied. For example, the effect of inverting 
stereocenters in the order C-4-»C-l-»C-2 can be determined using CMMs in the series e 

25 — > c-> b— > d (See Figure 8). While the current illustrations have been with SBL as a 

protein example, the method is clearly amenable to the glycosylation of any protein and is 
without limitation with respect to the sites and to the glycans that may be conjugated. It 
will, therefore, allow the introduction of any therapeutically important carbohydrate 
recognition determinant, of which the (3-D-galactopyranosyl moiety of e and f (See Figure 

30 8) that represents a ligand of the hepatic asialoglycoprotein receptor (Sharon et ah, Essays 
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Biochemu 30:59-75 (1995), which is hereby incorporated by reference) is just one 
example. 

Example 12 - Esterase Screen 

Specificity constants determined using the low substrate approximation 
were measured indirectly using Ellman's reagent (Ellman et ah, Biochem. Pharmacol. , 
7:88-95 (1961), which is hereby incorporated by reference) (s 4l2 = 13600 M" 'cm' 1 ) using 
0.15 and 0.30 mM succinyl-AAPF-SBn as substrate in 0.1 M Tris.HCl, containing 0.005 
vol% DMSO, 1 voI% 37.5 mM Ellman's reagent in DMSO, pH 8.6. 

Example 13 - Full Esterase Kinetics Measurements 

Michaelis-Menten constants were measured at 25 °C by curve fitting 
(Grafit® 3.03, Erithacus Software Ltd., Staines, Middlesex, UK) of the initial rate data 
determined at eight concentrations (31.25 |aM - 3 mM) of the succinyl-AAPF-SBn 
substrate, followed indirectly using Ellman's reagent in 0.1 M Tris.HCl, containing 0.005 
vol% DMSO, 1 vol% 37.5 mM Ellman's reagent in DMSO, pH 8.6. 

Example 14 - Esterase Activity Screen 

The glyco-CMMs shown in Table 4 were prepared, by reacting reagents 
la-k (See Figure 8) with the chosen cysteine mutants SBL-N62C, -S156C, -S166C and - 
L217C, purified and extensively characterized as described previously. 



Table 4. 
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a Kinetic constants determined in duplicate using the low substrate concentration approximation in 0.1 M Tris buffer. 
pH 8.6, 0.005% Tween 80. 1% DMSO with suc-AAPF-SBn as substrate. [S] =*= 15 or 30 u.M [E] = 4.8 x I0' n to 6.0 
\ 10- l(1 M. 

b E/A = (Ko/KuXs^nsc I (£ t a/^l/)atm«iaw 

c For SBL-WT: kJK M = 3592.5 mM-'s"', E/A - 17. 



The kinetic parameters of esterase activity were determined at pH 8.6 by indirectly . 
following the release of thiobenzyl alcohol from the substrate succinyl-Ala-Ala-Pro-Phe- 
SBn (suc-AAPF-SBn) with Ellman's reagent (Ellman et al. ? Biochem. Pharmacol. , 7:88- 
95 (1961), which is hereby incorporated by reference). To allow a rapid screen of 
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esterase activity, a low substrate concentration ([S] « K t J) was used that allowed k c JK u 
to be determined directly from the initial rate of reaction. The results from the screen are 
shown in Table 4. 

Modification at position 62, in the S 2 pocket, with deprotected sugar 
5 reagents lb-f (See Figure 8) increased k c JK hh resulting in a series of five enzymes that 
had similar k L JK iU s that were 1.3- to 1.9-fold greater than WT (Figure 13 A). The 
presence of an cc-linkage was clearly deleterious to activity, as N62C-SEt(3Glc (-c) (See 
Jf Figure 8) had a kJK M 1 .2-fold greater than its epimer N62C-SEtctGlc (-b) (See Figure 8) 

O and 1 .9-fold greater than WT. Furthermore, the oc-linked N62C-SEtaMan (-d) (See 

fy 10 Figure 8) had the lowest k c JK M in this group which was 1.3-foid greater than WT. 
;f1 As at position 62, the introduction of any of the deprotected sugar side 

l ^ chains b-f (See Figure 8) at position 217, in the S,' pocket increased k c JK M (Figure 13B). 

Q However, the effects of glycosylation at this site were far more dramatic as demonstrated 

□ by a k c JK u for L217C-SEt(3Gal (-e) (See Figure 8) that was 3.4-fold greater than WT. By 

15 comparing the kJK^ of L217C-S(3Glc and L217C-SEt(3GIc (-c) (See Figure 8), it was 
fU possible to gauge the effect on activity of introducing an ethyl tether at this position. This 

introduction increased k c JK M , from 1. 8-fold greater than WT for L217C-S(3Glc to 2.7- 
fold greater than WT for L217C-SEt(3Glc (-c) (See Figure 8). As at position 62, p-linked 
glyco-CMMs (-c,-e,-f) (See Figure 8) had higher k c JK^ than the a-linked ones (-a,-d) 
20 (See Figure 8). For example, L217C-SEtpGlc (-c) (See Figure 8) had a k c JK M 1.3-fold 
greater than L217C-SEtccGlc (-b) (See Figure 8). - 

Consistent with the surface exposed orientation of the SI 56 side chain, the 
S156C deprotected glyco-CMMs had similar k ca /K iV/ s that were 1.3- to 2.1-fold lower than 
WT (Figure 13C). 

25 At position 166, in the S, pocket, mutation to cysteine resulted in an 

enzyme with a severely lowered k u JK M that was 1 0-fold lower than WT. However, 
subsequent modification with lb-f (See Figure 8) restored much of the catalytic activity 
(Figure 13D), and the S166C deprotected glyco-CMMs S166C-S-b-f (See Figure 8) had 
similar k c JK M $>, that varied from 1 . 1 - to 1 .4-fold lower than WT. 
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Example 15 - Kinetic Effects of Glycosylation with Acetylated Carbohydrates. 

At position 62, in the S 2 pocket, in sharp contrast to the trend observed for 
the unacetyiated N62C CMMs, acetylated N62C CMMs had a wide range of k c JK SI s. 
Introduction of acetates increased the k c JK A jS of the a-linked CMMs relative to the 

5 corresponding deprotected glyco-CMMs (Figure 14A). Thus. N62C-SEtctGlc(Ac) 4 (-b) 
(See Figure 8) and N62C-SEtaMan(Ac) 4 (-d) (See Figure 8) had kJK u s 1.9- and 1.6-fold 
greater than WT, respectively. Acetylation was clearly deleterious for (3-linked CMMs, as 
N62C-SEtpGai(Ac) 3? N62C-SEtpGlc(Ac) 2 , and N62C-SEtpGlc(Ac) 3 all had kJK,j$ 
lower than WT. However, increasing the number of acetates present on the CMM 

10 restored activity: N62C-SEtj3Glc(Ac) 3 had a kJK M only 1 .3-fold lower than WT and 1 .5- 
fold higher than N62C-SEtpGlc(Ac) 2 . In spite of their size, the sterically bulky side chain 
lactosylated N62C CMMs, N62C-SEtLac (-f) (See Figure 8) and N62C-SEtLac(Ac) 7 (-k) 
(See Figure 8) had k c JK^ that were similar to those of the CMMs derived from 
monosaccharides. This provided a clear example of the versatility of the glycosylation 

1 5 method illustrated in Figure 8 and demonstrated that by using this method it was possible 
to introduce very large structures into the active site of SBL while maintaining catalytic 
competency. 

Modification with acetylated reagents la,g-k (See Figure 8) at position 
217, in the S, 1 pocket, as with lb-k (See Figure 8), led to CMMs with greater than WT 

20 k ca /K Xf s (Figure 14B), For the untethered glyco-CMMs, increasing the number of acetates 
dramatically increased k c JK M , from L8-fold greater than WT for L217C-SpGlc to 2.4- 
fold greater than WT for L217C-SpGlc(Ac) 2 and to 3.2-fold greater than WT for L217C- 
SpGlc(Ac) 3 and mirrored the trend seen in amidase kinetics. For the ethyl linked L217C 
glyco-CMMs, the effect of acetylation was dependent on the anomeric stereochemistry, as 

25 observed for N62C glyco-CMMs and the L217C glyco-CMMs amidase kinetics. 
Acetylation of a-linked CMMs increased k c JK u but decreased k c JK u for p-linked 
CMMs. This was most pronounced for L217C-SEtpGal(Ac) 3 which had a kJK KI only 
1.1 -fold greater than WT and 3.1 -fold lower than L217C-SEtpGal (-e). In contrast to the 
effect on deprotected L217C glyco-CMMs, the activity of acetylated L217C glyco-CMMs 

30 decreased upon the introduction of the ethyl linker. For example, L217C-SpGlc(Ac) 3 had 
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a k c JK M 3-2-fold greater than WT, as compared with the 1.4-fold greater than WT k c JK hf 
ofL217C-SEtpGlc(Ac) 3 . 

At position 156, in the 83 pocket, the S156C acetyiated glyco-CMMs 
displayed little difference in their kinetic constants from their unacetylated counterparts 
(Figure 14C), an observation that was consistent with the surface exposed nature of the 
position 1 56 side chain. Introducing the ethyl linker led to an increase in k c JK lU from 2.6- 
fold lower than WT for S156C-S(3Glc(Ac) 4 (-a) (See Figure 8) to 1.5-fold lower than WT 
for S156C-SEt(3Glc(Ac) 4 (-h) (See Figure 8). 

In general, at position 166, in the S { pocket, the effect of acetylation on 
S166C ethyl linked glyco-CMMs was to reduce k c JK^ relative to their deprotected 
counterparts (Figure 14D). The exceptions were the ethyl linked [3-gluco-CMMs, S166C- 
SEtpGlc(Ac) 2 and S166C-SEtpGlc(Ac) 3 , which displayed kJK Af s 1.4- and 1.7- fold 
greater than WT, respectively. These were the only two glyco-CMMs prepared at this site 
to show an enhancement in k c JK, u relative to WT, and this example illustrates that the 
correct selection of sugar is crucial to the tailoring of enzyme activity. In contrast to the 
effects observed at positions 62 and 217, an a-linkage to the sugar moiety was deleterious 
to the activity of the acetyiated CMMs and S166C-SEtaGlc(Ac) 3 had a kJK M 1.9-fold 
lower than WT, in direct contrast to S166C-SEt(3Glc(Ac) 3 . Introduction of the sterically 
bulky lactose moiety, in both acetyiated and unacetylated forms, led to CMMs S166C- 
SEtLac (-f) (See Figure 8) and S166C-SEtLac(Ac) 5 with low kJK M s that were 1.4- and 
2. 3 -fold lower than WT, respectively. 

Example 16 - Full Esterase Kinetics. 

The three esterases with the highest k ca /K tXI s determined by the above 
screen, L217C-SpGlc(Ac) 3? L217C-SEtaMan(Ac) 3 , and L217C-SEtpGal (-c) (See Figure 
8) had their individual k ca( s and K Xf s determined by the initial rates method. The results 
are shown in Table 5. 
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Table 5. 

Full Esterase Kinetic Data for Glycosylated CMMs a 



Enzyme 


^(s 1 ) 


£ A ,(mM) 




xWT 


E/A b 








(mM-'s 1 ) 






WT 


1940.0±180 


0.54±0.07 


3592.5±572.7 


1 


17 


L217C-SpGlc(Ac) 3 


4427.5±100.9 


0.15±0.01 


29516.7±2079.0 


8.4 


293 


L217C-SEtaMan(Ac) 3 


3827.0±59.5 


0.30±0.01 


12756.7±469.2 


3.6 


77 


L217C-SetpGal 


4398.5±189.8 


0.36±0.04 


12218. U1456.3 


3.5 


167 



3 Kinetic constants determined by method of initial rates in 0.1 M Tris buffer, pH 8.6, 
0.005% Tween 80, 1% DMSO with suc-AAPF-SBn as substrate. [S] = 30 jaM to 2 mM, 
[E]= 9.6 x 10" u to 1.1 x 10~ 10 M. 

E/A = (k c J ^v/) esterase / (^ c «/^V/)ai 



/amidase* 



The results were in good agreement with those determined by the screen 
for L217C-SEtctMan(Ac) 3 and L217C-SEt(3Gal (-e) (See Figure 8) and confirmed the 

10 activity of these two enzymes to be 3.6- and 3.5- fold higher than WT, respectively. 

These increases in activity arose from both increased transition state stabilization, with 
k ca/ s 2- and 2.3-fold greater than WT, respectively, and from greater substrate binding, 
with K^jS 1.8- and 1.5-fold lower than WT, respectively. 

Remarkable results were obtained for L217C-S(3Glc(Ac) 3 . This enzyme 

15 had a k ccu 2.3-fold greater than WT and a K M 3.6-fold lower than WT, giving a k c JK M 8.4- 
fold greater than WT and some 2.5-fold greater than the value estimated by the screen. 
The difference in parameters obtained from the screen and the full kinetic analysis 
exposed a limitation of the low substrate screen. For the low substrate approximation to 
be accurate, the substrate concentration must be small compared to K u . The K lU of 

20 L217C-SpGlc(Ac) 3 (0.15 mM) was evidently so small that the approximation did not hold 
in this case. This was the largest enhancement of activity relative to WT achieved using 
the combined site-directed mutagenesis and chemical modification strategy. 



Example 17 - Esterase Activity versus Amidase Activity. 

25 The differing effects of glycosylation upon amidase and esterase k c JK lU 

can be compared in an informative manner using the (k c JK u ) estevase I {kJK^ M ^ ratio, 
E/A (see Figure 15). 
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All N62C deprotected glyco-CMMs had E/As that were enhanced relative 
to WT. The increase in E/A was dependent on the presence of either an a- or a ^-linkage 
with the p-linked CMMs N62C-SEtpGlc (-c) (See Figure 8), ~SEt(3Gal (-e) (See Figure 
8), -SEtLac (-f) (See Figure 8) having higher E/As than the a-linked CMMs N62C- 
SEtaGlc (-b) (See Figure 8) and -SEtctMan (-d) (See Figure 8) (Figure 15 A). These 
increased ratios were due to both increases in esterase k c JK A p and reductions in amidase 

As observed for the modifications made at position 62, glycosylation at 
position 217, in the S,' pocket, led to enzymes that had greatly increased E/A ratios 
relative to WT. Mutation to cysteine at position 217 increased E/A to 6.4-fold greater 
than WT. Modification with unacetylated p-linked sugars -SpGlc, -SEtpGlc (-c) (See 
Figure 8), -SEtpGal (-e) (See Figure 8), -SEtLac (-f) (See Figure 8) further increased E/A 
(Figure 15B). In contrast, a-linked glyco-CMMs had lower E/A values than that of the 
mutant. These changes in E/As were a result of parallel changes in esterase k c JK A jS that 
were further amplified by opposing changes in amidase k c JK A fi. Introduction of an ethyl 
linker reduced the E/A from 10.9-fold greater than WT for L217C-SpGlc to 7.6-fold 
greater than WT for L217C-SEtpGlc (-c) (See Figure 8). 

At position 156, in the S, pocket, E/A ratios for unacetylated glyco-CMMs 
were all similar to each other in the range 1.2- to 1 .6-fold greater than WT (Figure 15C). 

The mutant S166C had an exceptionally low E/A that was 4.2-fold lower 
than WT (Figure 15D) and this was largely a result of its very low esterase k ca JK u . 
Because modification of SI 66C restored esterase k c JK^ to levels approaching that of 
WT and because SI 66C glyco-CMMs were poor amidases relative to WT, the net result 
was a family of CMMs with similar E/A ratios that were all enhanced relative to WT and 
significantly higher than the cysteine mutant (Figure 1 5D). 

Example 18 - Effects of Glycosylation with Acetylated Carbohydrates on E/A. 

At position 62, in the S 2 pocket, with the exception of N62C-SEtpGal(Ac) 3 
and N62C-SEtaMan(Ac) 4 (-i) (See Figure 8), acetylation of N62C glyco-CMMs led to a 
reduction in E/A. However, like their deprotected counterparts, the acetylated N62C 
glyco-CMMs all had larger E/As than WT (Figure 16A). Increasing the level of 
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acetylation increased the E/A ratio from 2. 1 -fold greater than WT for N62C- 
SEtpGlc(Ac), to 3.0-fold greater than WT for N62C-SEt(3Glc(Ac) 3 . In spite of the 
general increases in E/A observed for the ethyl linked glyco-CMMs, the untethered CMM 
N62C-SPG1c(Ac) 4 (-a) (See Figure 8) had an E/A very similar to WT. 

5 At position 217, in the S t ' pocket, L217C acetylated glyco CMMs all had 

enhanced E/As relative to WT (Figure 16B). As for their deprotected counterparts, (3- 
linked acetylated glyco-CMMs had higher E/As than that of the L217C mutant, whereas 
the E/As of the a-linked acetylated glyco-CMMs were lower than that of the mutant. In 
fact, modification at this site produced CMMs with by far the greatest enhancement of 

10 this ratio and the E/A for L217C-SpGlc(Ac) 3 was 17.2-fold greater than WT. In contrast 
to the L217C deprotected glyco-CMMs, introducing an ethyl linker lowered E/A from 
17.2-fold greater than WT for L217C-SpGlc(Ac) 3 to 9.2-fold greater than WT for L217C- 
SEtpGlc(Ac) 3 . For the untethered L217C acetylated glyco-CMMs, increasing the number 
of acetates also increased E/A, from 5.0-fold greater than WT for L217C-SEtpGlc(Ac) 2 to 

15 17.2-fold greater than WT for L217C-SEt(3Glc(Ac) 3 . 

At position 156, in the S } pocket, acetylation of a-linked glyco CMMs 
increased E/As (Figure 16C). Hence, S156C-SEtpGlc(Ac) 4 (-h) (See Figure 8) had an 
E/A that was 1.5 fold greater than WT whereas S156C-SEtaGlc(Ac) 4 (-g) (See Figure 8) 
had an E/A 2.0-fold greater than WT. S156C-SEtocMan(Ac) 4 (-i) (See Figure 8) also had 

20 an E/A 1 .3-fold greater than that of its unacetylated counterpart, SI 56C-SEtaMan (-d) 
(See Figure 8). 

At position 166, in the S t pocket, acetylation caused an increase in E/A for 
the glucosylated CMMs, irrespective of the anomeric stereochemistry (Figure 16D). 
Acetylation of all other sugar moieties led to a reduction in E/A. Increasing the number 
25 of acetates increased E/A from 2.7-fold greater than WT for S166C-SEtpGlc(Ac) 2 to 3.2- 
fold greater than WT for S166C-SEtpGlc(Ac) 3 . 



Example 19 - Molecular Modeling 
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The X-ray structure of subtilisin Bacillus lentas with the peptide inhibitor 
AAPF bound (Brookhaven database entry 1 JEA) was used as the starting point for 
calculations on wild type and CMMs. The enzyme setup was performed with Insight II, 
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version 2.3 0 (Biosym Technologies, Inc. San Diego, CA). To create initial coordinates 
for the minimization, hydrogens were added at the pH 8.6 used for kinetic measurements. 
This protonated all Lys and Arg residues and the N-terminus and deprotonated all Glu and 
Asp residues and the C-terminal carboxyl group. The protonated form of His 64 was used 
in all calculations. The model system was solvated with a 5 A layer of water molecules. 
The total number of water molecules in the system was 1 143. The overall charge of the 
enzyme-inhibitor complex resulting from this setup was +4 for the WT enzyme. Energy 
simulations were performed with the DISCOVER program, Version 2.9.5 (Biosym 
Technologies, Inc., San Diego, CA) on a Silicon Graphics Indigo computer, using the 
consistent valence force field (CVFF) function. A non-bonded cutoff distance of 18 A 
with a switching distance of 2 A was employed. The non-bonded pair list was updated 
every 20 cycles and a dielectric constant of 1 was used in all calculations. The WT 
enzyme was minimized in stages, with initially only the water molecules being allowed to 
move, followed by water molecules and the amino acid side chains, and then finally the 
entire enzyme. The mutated and chemically modified enzymes were generated by 
modifying the relevant amino acid using the Builder module of Insight. These structures 
were then minimized in a similar manner. Initially, the side-chain of the mutated residue 
and the water molecules were minimized. Then, all side-chains and the water molecules 
were minimized while the backbones of the residues were constrained, then all of the 
atoms were minimized. The AAPF inhibitor was free to move throughout all stages of 
the minimization. Each stage of energy minimization was conducted by means of the 
method of steepest descents without Morse or cross terms until the derivative of energy 
with respect to structural perturbation was less than 5.0 kcal/A; then, the method of 
conjugate gradients, without Morse or cross terms until the derivative of energy with 
respect to structural perturbation was less than 1 .0 kcal/A; and, finally, the method of 
conjugate gradients, with Morse and cross terms until the final derivative of energy with 
respect to structural perturbation was less than 0. 1 kcal/A. 

The molecular basis for the vastly improved esterase activities observed 
was analyzed by molecular modeling of the peptidyl product inhibitor AAPF bound to the 
SBL-CMM L217C-SpGlc(Ac) 3 . While the substrate employed for kinetic analysis is 
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succinylated, this moiety was not included in molecular modeling since its orientation 
was not reported in the X-ray structure of SBL, suggesting high mobility in the crystal. 

As shown in Figures 17 and 18, molecular modeling analysis of L217C- 
Sf3Glc(Ac)3 revealed that the observed k c JK u changes correlate with the occupation of the 
Sj' pocket of SBL by the glucosylated -SPGlc(Ac) 3 side chain. In the minimized structure 
(Figure 17), the position of the glucose moiety was fixed by a network of hydrogen- 
bonding interactions (shown as white dotted lines) between the oxygen atoms on the C-3, 
4, and 6 substituents of glucose and water molecules in the surrounding external solvent. 
This extensive solvation directed the C-2 substituent of glucose internally towards the 
catalytic triad. In this orientation, the carbonyl oxygen atom of the C-2 acetate group acts 
a hydrogen bond (1.89 A) acceptor and stabilizes water molecule 127 in close proximity 
to the carboxy terminus of AAPF, used here as a substrate analog for modeling, and is 
shown in Figure 17 hydrogen-bonding to the O atom of the carboxylic acid. 

These results suggested that firstly the low amidase activity of L21 7C-S- 
Glc(Ac) 3 , which was 2-fold lower than WT, was a result of the S/ pocket being occupied 
by the glucose moiety at position 217. This prevents efficient binding of the pHA leaving 
group and, therefore, decreases the rate of acyl-enzyme intermediate formation which is 
the rate-determining step for amidase activity. Secondly, after the pHA has been 
displaced to form the covalent Acyl-Ser221 intermediate, the glucose moiety stabilizes a 
crucial, nucleophilic water molecule (Wat 127) in close proximity to the carbonyl carbon 
atom, through a hydrogen-bond to the oxygen of the C-2 acetate of glucose, as illustrated 
in Figure 1 8, This facilitates hydroylsis of the acyl-enzyme intermediate and therefore 
increases the rate of deacylation, which is the rate limiting step for esterase activity 
(Zerner et aL, J. Am. Chem. Soc , 86:3674-3679 (1964); Whitaker et ah, J. Am. Chem. 
Soc , 87:2728-2737 (1965); Berezin et al., FEBS Lett. , 15:121-124 (1971), which are 
hereby incorporated by reference). 

Glycosylation of SBL at sites within the active site can dramatically 
enhance its esterase activity. The library of glycosylated CMMs synthesized using the 
combined site directed mutagenesis and chemical modification strategy contains 22 
CMMs with greater than WT activity. Glycosylation at positions 62, in the S 2 pocket, and 
2 1 7, in the S,' pocket, gave the greatest increases in k c JK S} . The most active CMM 
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L2 1 7C-SpGlc(Ac) 3 had a kJK Kl that was 8.4-fold greater that WT and was the most 
active esterase synthesized using this approach. When surface exposed position 156 was 
glycosylated, there was little alteration in activity, and this demonstrates that the 
introduction of sugars at such sites has little effect on the catalytic activity of SBL. 

In addition to the tailoring of esterase k c JK M values, glycosylation also led 
to enormous improvements in specificity for ester versus amide hydrolysis, as determined 
by measurement of the ratio (kJK lU ) es ^ 1 (KJK K ^ md2LSe , E/A. This ratio has been 
increased to 17.2-fold greater than WT for L217C-SPGlc(Ac) 3 . Such enzymes are very 
attractive candidates for use in peptide synthesis, where a high esterase to amidase ratio is 
desirable. Furthermore, the CMMs described above have an even greater potential for 
this purpose, as the increases in E/As have been achieved by an increase in the catalytic 
efficiency of these enzymes towards ester substrates, in addition to a reduction in the 
amidase activity. 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations can 
be made therein by those skilled in the art without departing from the spirit and scope of 
the invention which is defined by the following claims. 



